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NOVEL PROTEINS AND NUCLEIC ACIDS ENCODING SAME 

BACKGROUND OF THE INVENTION 

The invention generally relates to nucleic acids and polypeptides. More particularly, the 
invention relates to nucleic acids encoding novel molecules (MOL) polypeptides, as well as 
5 vectors, host cells, antibodies, and recombinant methods for producing these nucleic acids and 
po]ypeptides. 

SUMMARY OF THE INVENTION 

The invention is based in part upon the discovery of nucleic acid sequences encoding 
novel polypeptides. The novel nucleic acids and polypeptides are referred to herein as MOLX, 
10 or MOLL MOL2, MOL3, MOL4, MOL5, MOL6, MOL7, MOL8, MOL9, and MOL10 nucleic 
acids and polypeptides. These nucleic acids and polypeptides, as well as derivatives, homology 
analogs and fragments thereof, will hereinafter be collectively designated as "MOLX" nucleic 
acid or polypeptide sequences. 

* In one aspect, the invention provides an isolated MOLX nucleic acid molecule encoding 

15 a MOLX polypeptide that includes a nucleic acid sequence that has identity to the nucleic acids 
disclosed in SEQ ID NOS:l, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, and 29. In some 
embodiments, the MOLX nucleic acid molecule will hybridize under stringent conditions to a 
nucleic acid sequence complementary to a nucleic acid molecule that includes a protein-coding 
sequence of a MOLX nucleic acid sequence. The invention also includes an isolated nucleic acid 
20 that encodes a MOLX polypeptide, or a fragment, homolog, analog or derivative thereof. For 
example, the nucleic acid can encode a polypeptide at least 80% identical to a polypeptide 
comprising the amino acid sequences of SEQ ED NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 
26, 28, and 30. The nucleic acid can be, for example, a genomic DNA fragment or a cDNA 
molecule that includes the nucleic acid sequence of any of SEQ ID NOS: 1 , 3, 5, 7, 9, 1 1 , 13, 1 5, 
25 17, 19, 21, 23, 25, 27, and 29. 

Also included in the invention is an oligonucleotide, e.g., an oligonucleotide which 
includes at least 6 contiguous nucleotides of a MOLX nucleic acid (e.g. t SEQ ID NOS:l, 3,5,7, 
9, 11, 13, 15, 17, 19, 21, 23, 25, 27, and 29) or a complement of said oligonucleotide. 

Also included in the invention are substantially purified MOLX polypeptides (SEQ ED 
30 NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30). In certain embodiments, the 

1 
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MOLX polypeptides include an amino acid sequence that is substantially identical to the amino 
acid sequence of a human MOLX polypeptide. 

The invention also features antibodies that immunoselectively bind to MOLX 
polypeptides, or fragments, homologs, analogs or derivatives thereof. 
5 In another aspect, the invention includes pharmaceutical compositions that include 

therapeutically- or prophylactically-effective amounts of a therapeutic and a pharmaceutically- 
acceptable carrier. The therapeutic can be, e.g., a MOLX nucleic acid, a MOLX polypeptide, or 
an antibody specific for a MOLX polypeptide. In a further aspect, the invention includes, in one 
or more containers, a therapeutically- or prophylactically-effective amount of this 
10 pharmaceutical composition. ' 

In a further aspect, the invention includes a method of producing a polypeptide by 
culturing a cell that includes a MOLX nucleic acid, under conditions allowing for expression of 
the MOLX polypeptide encoded by the DNA. If desired, the MOLX polypeptide can then be 
recovered. 

15 In another aspect the invention includes a method of detecting the presence of a MOLX 

polypeptide in a sample. In the method, a sample is contacted with a compound that selectively 
binds to the polypeptide under conditions allowing for formation of a complex between the 
polypeptide and the compound. The complex is detected, if present, thereby identifying the 
MOLX polypeptide within the sample. 

20 The invention also includes methods to identify specific cell or tissue types based on their 

expression of a MOLX. 

Also included in the invention is a method of detecting the presence of a MOLX nucleic 
acid molecule in a sample by contacting the sample with a MOLX nucleic acid probe or primer, 
and detecting whether the nucleic acid probe or primer bound to a MOLX nucleic acid molecule 

25 . in the sample. 

In a further aspect, the invention provides a method for modulating the activity of a 
MOLX polypeptide by contacting a cell sample that includes the MOLX polypeptide with a 
compound that binds to the MOLX polypeptide in an amount sufficient to modulate the activity 
of said polypeptide. The compound can be, e.g., a small molecule, such as a nucleic acid, 
30 peptide, polypeptide, peptidomimetic, carbohydrate, lipid or other organic (carbon containing) or 
inorganic molecule, as further described herein. 

Also within the scope of the invention is the use of a therapeutic in the manufacture of a 
medicament for treating or preventing disorders or syndromes including, e.g., diabetes, 
metabolic disturbances associated with obesity, the metabolic syndrome X, anorexia, wasting 

2 
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disorders associated with chronic diseases, metabolic disorders, diabetes, obesity, infectious 
disease, anorexia, cancer-associated cachexia, cancer, neurodegenerative disorders, Alzheimer's 
Disease, Parkinson's Disorder, immune disorders, and hematopoietic disorders, or other 
disorders related to cell signal processing and metabolic pathway modulation. The therapeutic 
5 can be, e.g., a MOLX nucleic acid, a MOLX polypeptide, or a MOLX-specific antibody, or 
biologically-active derivatives or fragments thereof. 

For example, the compositions of the present invention will have efficacy for treatment 
of patients suffering from: Cancer including pancreatic cancer, adenoma, brain tumor, colon 
cancer breast cancer, prostate cancer, testis cancer, neurological disorders including age-related 

10 disorders, Alzheimer's disease, Stroke, Parkinson's disease, Huntington's disease, Cerebral palsy, 
Epilepsy, Behavioral disorders, Addiction, Anxiety, Pain, nephropathy, neurodegenerative 
disorders, Aneurysms, Fibromuscular dysplasia, metabolic disorders including, failure to thrive, 
nutritional edema, hypoproteinemia, trypsinogen deficiency disease, chronic and heriditary 
pancreatitis, enterkinase defieciency, Hypercholesterolemia, Obesity, Diabetes, cardiac disorders 

1 5 inclusing tachycardia, erythroderma, long QT syndrome, heart block, Ataxia-telangiectasia, 

Cardiomyopathy, Atherosclerosis, Hypertension, Congenital heart defects, Aortic stenosis, Atrial 
septal defect (ASD), Atrioventricular (A-V) canal defect, Ductus arteriosus, Pulmonary stenosis, 
Subaortic stenosis, Ventricular septal defect (VSD), valve diseases, Larsen syndrome, night 
blindness, brugada syndrome, Von Hippel-Iindau (VHL) syndrome, Tuberous sclerosis, 

20 Hypercalcemia, Cirrhosis, angiogenesis and wound healing, blood pressure regulation, Trauma, 
Tuberous sclerosis, Fertility, Hirschsprung's disease, Renal artery stenosis, Interstitial nephritis, 
Glomerulonephritis, Polycystic kidney disease, Renal tubular acidosis, IgA nephropathy, 
immune disorders including cell-mediated immunity disorders, Leukodystrophies, inflammation, 
Hyperthyroidism, Hypothyroidism, Lesch-Nyhan syndrome, Multiple sclerosis, Transplantation, 

25 lung diseases, including asthma, Emphysema, immundeficiencies, Crohn's disease, Scleroderma, 
Appendicitis, Autoimmune diseases, Systemic lupus erythematosus, developmental disorders, 
neural tube defects, modulation of apoptosis, viral, bacterial, and parasitic infections and/or other 
pathologies and disorders of the like. 

The polypeptides can be used as immunogens to produce antibodies specific for the 

30 invention, and as vaccines. Hiey can also be used to screen for potential agonist and antagonist 
compounds. For example, a cDNA encoding MOLX may be useful in gene therapy, and MOLX 
may be useful when administered to a subject in need thereof. By way of nonlimiting example, 
the compositions of the present invention will have efficacy for treatment of patients suffering 
from Cancer including pancreatic canceT, adenoma, brain tumor, colon cancer breast cancer, 
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prostate cancer, testis cancer, neurological disorders including age-related disorders, Alzheimer's 
disease, Stroke, Parkinson's disease, Huntington's disease, Cerebral palsy, Bpilepsy, Behavioral 
disorders, Addiction, Anxiety, Pain, nephropathy, neurodegenerative disorders, Aneurysms, 
Fibromuscular dysplasia, metabolic disorders including, failure to thrive, nutritional edema, 
5 hypoproteincmia, tripsinogen deficiency disease, chronic and heriditary pancreatitis, enterkinase 
deficiency, Hypercholesterolemia, Obesity, Diabetes, cardiac disorders inclusing tachycardia, 
eiythroderma, long QT syndrome, heart block, Ataxia-telangiectasia, Cardiomyopathy, 
Atherosclerosis, Hypertension, Congenital heart defects, Aortic stenosis, Atrial septal defect 
(ASD), Atrioventricular (A-V) canal defect, Ductus arteriosus, Pulmonary stenosis, Subaortic 

10 stenosis, Ventricular septal defect (VSD), valve diseases, Larsen syndrome, night blindness, 
brugada syndrome, Von Hippel-Lindau (VHL) syndrome, Tuberous sclerosis, Hypercalceimia, 
Cirrhosis, angiogenesis and wound healing, blood pressure regulation, Trauma, Tuberous 
sclerosis, Fertility, Hirschsprung's disease, Renal artery stenosis, Interstitial nephritis, 
Glomerulonephritis, Polycystic kidney disease, Renal tubular acidosis, IgA nephropathy, 

1 5 immune disorders including cell-mediated immunity disorders, Leukodystrophies, inflammation, 
Hyperthyroidism, Hypothyroidism, Lesch-Nyhan syndrome, Multiple sclerosis, Transplantation, 
lung diseases, including asthma, Emphysema, immundeficiencies, Crohn's disease, Scleroderma, 
Appendicitis, Autoimmune diseases, Systemic lupus erythematosus, developmental disorders, 
neural tube defects, modulation of apoptosis, viral, bacterial, and parasitic infections and/or other 

20 pathologies and disorders. 

The invention further includes a method for screening for a modulator of disorders or 
syndromes including, e.g., Cancer including pancreatic cancer, adenoma, brain tumor, colon 
cancer breast cancer, prostate cancer, testis cancer, neurological disorders including age-related 
disorders, Alzheimer's disease, Stroke, Parkinson's disease, Huntington's disease, Cerebral palsy, 

25 Epilepsy, Behavioral disorders, Addiction, Anxiety, Pain, nephropathy, neurodegenerative 

disorders, Aneurysms, Fibromuscular dysplasia, metabolic disorders including, failure to thrive, 
nutritional edema, hypoproteinemia, tiypsinogen deficiency disease, chronic and heriditary 
pancreatitis, enterkinase defieciency, Hypercholesterolemia, Obesity, Diabetes, cardiac disorders 
inclusing tachycardia, erythroderma, long QT syndrome, heart block, Ataxia-telangiectasia, 

30 Cardiomyopathy, Atherosclerosis, Hypertension, Congenital heart defects, Aortic stenosis, Atrial 
septal defect (ASD), Atrioventricular (A-V) canal defect, Ductus arteriosus, Pulmonary stenosis, 
Subaortic stenosis, Ventricular septal defect (VSD), valve diseases, Larsen syndrome, night 
blindness, brugada syndrome, Von Hippel-Lindau (VHL) syndrome, Tuberous sclerosis, 
Hypercalceimia, Cirrhosis, angiogenesis and wound healing, blood pressure regulation, Trauma, 

4 
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Tuberous sclerosis, Fertility, Hirschsprung's disease, Renal artery stenosis, Interstitial nephritis, 
Glomerulonephritis, Polycystic kidney disease, Renal tubular acidosis, IgA nephropathy, 
immune disorders including cell-mediated immunity disorders, Leukodystrophies, inflammation, 
Hyperthyroidism, Hypothyroidism, Lesch-Nyhan syndrome, Multiple sclerosis, Transplantation, 
5 lung diseases, including asthma, Emphysema, immundeficiencies, Crohn's disease, Scleroderma, 
Appendicitis, Autoimmune diseases, Systemic lupus erythematosus, developmental disorders, 
neural tube defects, modulation of apoptosis, viral, bacterial, and parasitic infections or other 
disorders related to cell signal processing and metabolic pathway modulation. The method 
includes contacting a test compound with a MOLX polypeptide and determining if the test 

1 0 compound binds to said MOLX polypeptide. Binding of the test compound to the MOLX 

polypeptide indicates the test compound is a modulator of activity, or of latency or predisposition 
to the aforementioned disorders or syndromes. 

Also within the scope of the invention is a method for screening for a modulator of 
activity, or of latency or predisposition to an disorders or syndromes including, e.g. , Cancer 

1 5 including pancreatic cancer, adenoma, brain tumor, colon cancer breast cancer, prostate cancer, 
testis cancer, neurological disorders including age-related disorders, Alzheimer's disease, Stroke, 
Parkinson's disease, Huntington's disease, Cerebral palsy, Epilepsy, Behavioral disorders, 
Addiction, Anxiety, Pain, nephropathy, neurodegenerative disorders, Aneurysms, Fibromuscular 
dysplasia, metabolic disorders including, failure to thrive, nutritional edema, hypoproteinemia, 

20 trypsinogen deficiency disease, chronic and heriditary pancreatitis, enterkinase defieciency, 

Hypercholesterolemia, Obesity, Diabetes, cardiac disorders inclusing tachycardia, erythroderma, 
long QT syndrome, heart block, Ataxia-telangiectasia, Cardiomyopathy, Atherosclerosis, 
Hypertension, Congenital heart defects, Aortic stenosis, Atrial septal defect (ASD), 
Atrioventricular (A-V) canal defect, Ductus arteriosus, Pulmonary stenosis, Subaortic stenosis, 

25 Ventricular septal defect (VSD), valve diseases, Larsen syndrome, night blindness, brugada 

syndrome, Von Hippel-Lindau (VHL) syndrome, Tuberous sclerosis, Hypercalceimia, Cirrhosis, 
angiogenesis and wound healing, blood pressure regulation, Trauma, Tuberous sclerosis, 
Fertility, Hirschsprung's disease, Renal artery stenosis, Interstitial nephritis, Glomerulonephritis, 
Polycystic kidney disease, Renal tubular acidosis, IgA nephropathy, immune disorders including 

30 cell-mediated immunity disorders, Leukodystrophies, inflammation, Hyperthyroidism, 

Hypothyroidism, Lesch-Nyhan syndrome, Multiple sclerosis, Transplantation, lung diseases, 
including asthma, Emphysema, immundeficiencies, Crohn's disease, Scleroderma, Appendicitis, 
Autoimmune diseases, Systemic lupus erythematosus, developmental disorders, neural tube 
defects, modulation of apoptosis, viral, bacterial, and parasitic infections or other disorders 

5 
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related to cell signal processing and metabolic pathway modulation by administering a test 
compound to a test animal at increased risk for the aforementioned disorders or syndromes. The 
test animal expresses a recombinant polypeptide encoded by a MOLX nucleic acid. Expression 
or activity of MOLX polypeptide is then measured in the test animal, as is expression or activity 
5 of the protein in a control animal which recombinantly-expresses MOLX polypeptide and is not 
at increased risk for the disorder or syndrome. Next, the expression of MOLX polypeptide in 
both the test animal and the control animal is compared. A change in the activity of MOLX 
polypeptide in the test animal relative to the control animal indicates the test compound is a 
modulator of latency of the disorder or syndrome. 

10 In yet another aspect, the invention includes a method for determining the presence of or 

predisposition to a disease associated with altered levels of a MOLX polypeptide, a MOLX 
nucleic acid, or both, in a subject (e.g., a human subject). The method includes measuring the 
amount of the MOLX polypeptide in a test sample from the subject and comparing the amount of 
the polypeptide in the test sample to the amount of the MOLX polypeptide present in a control 

1 5 sample. An alteration in the level of the MOLX polypeptide in the test sample as compared to 
the control sample indicates the presence of or predisposition to a disease in the subject. 
Preferably, the predisposition includes, e.g., diabetes, metabolic disturbances associated with 
obesity, the metabolic syndrome X, anorexia, wasting disorders associated with chronic diseases, 
metabolic disorders, diabetes, obesity, infectious disease, anorexia, cancer-associated cachexia, 

20 cancer, neurodegenerative disorders, Alzheimer's Disease, Parkinson's Disorder, immune 

disorders, and hematopoietic disorders. Also., the expression levels of the new polypeptides of 
the invention can be used in a method to screen for various cancers as well as to determine the 
stage of cancers. 

In a further aspect, the invention includes a method of treating or preventing a 
25 pathological condition associated with a disorder in a mammal by administering to the subject a 
MOLX polypeptide, a MOLX nucleic acid, or a MOLX-specific antibody to a subject (e.g. 9 a 
human subject), in an amount sufficient to alleviate or prevent the pathological condition. In 
preferred embodiments, the disorder, includes, e.g. 9 Cancer including pancreatic cancer, 
adenoma, brain tumor, colon cancer breast cancer, prostate cancer, testis cancer, neurological 
30 disorders including age-related disorders, Alzheimer's disease, Stroke, Parkinson's disease, 

Huntington's disease, Cerebral palsy, Epilepsy, Behavioral disorders, Addiction, Anxiety, Pain, 
nephropathy, neurodegenerative disorders, Aneurysms, Fibrornuscular dysplasia, metabolic 
disorders including, failure to thrive, nutritional edema, hypoproteinemia, trypsinogen deficiency 
disease, chronic and heriditary pancreatitis, enterkinase defieciency, Hypercholesterolemia, 

6 
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Obesity, Diabetes, cardiac disorders inclusing tachycardia, erythroderma, long QT syndrome, 
heart block, Ataxia-telangiectasia, Cardiomyopathy, Atherosclerosis, Hypertension, Congenital 
heart defects, Aortic stenosis, Atrial septal defect (ASD), Atrioventricular (A-V) canal defect, 
Ductus arteriosus, Pulmonary stenosis, Subaortic stenosis, Ventricular septal defect (VSD), valve 
5 diseases, Larsen syndrome, night blindness, brugada syndrome, Von Hippel-Lindau (VHL) 
syndrome, Tuberous sclerosis, Hypercalceimia, Cirrhosis, angiogenesis and wound healing, 
blood pressure regulation, Trauma, Tuberous sclerosis, Fertility, Hirschsprung's disease, Renal 
artery stenosis, Interstitial nephritis, Glomerulonephritis, Polycystic kidney disease, Renal 
tubular acidosis, IgA nephropathy, immune disorders including cell-mediated immunity 

10 disorders, Leukodystrophies, inflammation, Hyperthyroidism, Hypothyroidism, Lesch-Nyhan 
syndrome, Multiple sclerosis, Transplantation, lung diseases, including asthma, Emphysema, 
immundeficiencies, Crohn's disease, Scleroderma, Appendicitis, Autoimmune diseases, Systemic 
lupus erythematosus, developmental disorders, neural tube defects, modulation of apoptosis, 
viral, bacterial, and parasitic infections. 

15 In yet another aspect, the invention can be used in a method to identity the cellular 

receptors and downstream effectors of the invention by any one of a number of techniques 
commonly employed in the art. These include but are not limited to the two-hybrid system, 
affinity purification, co-precipitation with antibodies or other specific-interacting molecules. 
Unless otherwise defined, all technical and scientific terms used herein have the same 

20 meaning as commonly understood by one of ordinary skill in the art to which this invention 

belongs. Although methods and materials similar or equivalent to those described herein can be 
used in the practice or testing of the present invention, suitable methods and materials are 
described below. All publications, patent applications, patents, and other references mentioned 
herein are incorporated by reference in their entirety. In the case of conflict, the present 

25 specification, including definitions, will control. In addition, the materials, methods, and 
examples are illustrative only and not intended to be limiting. 

Other features and advantages of the invention will be apparent from the following 
detailed description and claims. 

DETAILED DESCRIPTION OF THE INVENTION 

30 The invention is based, in part, upon the discovery of novel nucleic acid sequences that 

encode novel polypeptides. The novel nucleic acids and their encoded polypeptides are referred 
to individually as MOL1, MOL2, MOL3, MOL4, MOL5, MOL6, MOL7, MOL8, MOL9, and 
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MOL10. The nucleic acids, and their encoded polypeptides, are collectively designated herein as 
"MOLX". 

The novel MOLX nucleic acids of the invention include the nucleic acids whose 
sequences are provided in Tables 1 A, 2A, 3A, 4A, 5A, 6A, 6D, 7A, 8A, 8D, 9A, 9D, 9F, and 
10A. inclusive ("Tables 1 A - 10A"), or a fragment, derivative, analog or homolog thereof. The 
novel MOLX proteins of the invention include the protein fragments whose sequences are 
provided in Tables IB, 2B, 3B, 4B 5B, 6B, 6E, 7B, 8B, 8E, 9B, 9E, 9G and 10B inclusive 
("Tables IB - 10B"). The individual MOLX nucleic acids and proteins are described below. 
Within the scope of this invention is a method of using these nucleic acids and peptides in the 
treatment or prevention of a disorder related to cell signaling or metabolic pathway modulation. 



MOL1 



A disclosed interleukin-1 receptor/Toll-like nucleic acid of 1050 nucleotides, MOL1, is 
shown in Table 1 A. The disclosed MOL1 open reading frame ("ORF") begins at the ATG 
initiation codon at nucleotides 1-3, shown in bold in Table 1A. The encoded polypeptide is 
1 5 alternatively referred to herein as MOL1 or as GMJ799601 78. The disclosed MOL1 ORF 

terminates at a TGA codon at nucleotides 3043-3045. As shown in Table 1 A the start and stop 
codons are in bold letters. 



Table 1A. MOL1 nucleotide sequence (SEQ ID NO:l). 

ATGCTGGCCATGACCCTGGCCCTGGGTACCTTGCCTGCCTTCCrACCCTGTGAGCrcCAGCCCCACGGCCTGGTGAA 

CTGCAACTGGCTGTTCCTGAAGTCTGTGCCCCACTTCTGCATGGCAGC^CCCCGTGGCAATGTCACCAGCCTTTCCT 

TGTCCTCCAACCGCATCCACCACCTCCATGATTCTGACTTTGCCCACCTGCCCAGCCTGCGGCATCTCAACCTCAAG 

TGGAACTGCCCGCCGGTTGGCCTCAGCCCCATGCACTTCCCCTGCCACATGACCATCGAGCCCAGCACCTTCTTGGC 

TGTGCCCACXCTGGAAGAGCrAAACCTGAGCTACAACAACATCATGACTGTGCCTGCGCTGCCCAAATCCCTCATA^ 

CCCTGTCCCTCAGCCATACCAACATCCTGATGCTAGACTCTGC<^GCCTCGCCGGCCTGCATGCCCTGCGCTTCCTA 

TTCATGGACGGCAACTGTTATTACAAGAACCCCTGCAGGCAGGCACTGGAGGTGGCCCCGGGTGCCCTCCTTGGCCT 

GGGCAACCI^CCCACCTGTCACTCAAGTACAACAACCTCACTGTGGTGCCCCGCAACCTGCCTTCCAGCCTGGAGT 

ATCTGCTGrTGTCCTACAACCGCATCGTCAAACTGGCGC 

GATGTGGGCGGAAATTGCCGCCGCTGCGACCACGCTCCCAACCCC 

ACATCCCGATACCTTCAGCCACCTGAGCCGTCTTGAAGGCCTGCT 

CCAGTTGGTTCCGTGGGCTGGGAAACCTCCGAGTGCTGGACCTGAG 

ACCAAGGCC1TCCAGGGCCTAACACAGCTGCGCAAGCTTAACCTGTCCTT 

CCACCTGTCTCTGGCCCCTTCCl^CGGGAGCCTGGTCGCCCTGAAGGAGCTGGACATGCACGGCATCTTCTTCX^^ 
CACTCGATGAGACCACGCTCCGGCCACTGGCCCGCCTGCCCATGCTCCAGACTCTGCGTCTGCAGATGAACTTCATC 
AACCAGGCCC^GCTCGGCATCTTQIGGGCCTTCCCTGGCCTGCGCT 
AGCTTCGGAGCTGACAGCC^CC^TGSGGGAGGCAGATGGAGGGGAGAA 

CGGCCCCAGTGGACACTCCCAGCTCTGAAGACTTCAGGCCCAACTGCAGCACCCTCAACTTCACCTTGGATCrGTCA 

CGGAACAACCTGGTGACCGTGCAGCCGGAGATGTTTGCCCAGCTCTCGCACCTGCAGTGCCTGCGCCTGAGCCACAA 

CTGCATCTCGCAGGCAGTCAATGGCTCCCAGTTCCT(3CCGCTGACCGGTCTGCAGGTGCTAGACCTGTCC(^CAATA 

AGCTGGACCTCTACCACGAGCACTCATTCACGGAGCTACCACGACTGGAGGCCCTGGACCTCAGCTACAACAGCCAG 

CCCTTTGGCATGCAGGGCGTGGGCCACAACTTCAGCTTCGTGGCTCACCTGCGCACCCTGCGCCACCT 

CCACAACAACATCC&CAGCCAAGPrGTCCCAGCAGCTCTGC^ 

CACTGGGCC ATAT GTGGGCCG AGGGAGACCTCTATCTGCACT TCTTCC AAGGCCTG AGCGGTTTGAT CTGGCTGGAC 

TTGTCCCAGAACCGCCTGC^CACCCTCCT^CCCCAAACCCT^ 

CCGTGACAATTACCTGGCCTTCTTTAACTrGGTGGAGCCTCCACTTCC^ 

GAAACCAGCTGAAGGCCCTGACCAATGGCAGCCTGCCTGCTGGCACCCGGCTCCGGAGGCTGGATGTCAGCT 
AGCATCAGCTTCGTGGCCCCCGGCTTCTTrTCCAAGGCCA 

CAAGACAGTGGACCACTCCTGGTTTGGGCCCCTGGCGAGTGCCCTGCAAATACTAGATGTAAGCGCCAACCCTCTGC 
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ACTGCGCCTGTGGGGCGGCCTTTATGGACTTCCTGCTGGAG3TGCAGGCTGCCGTGCCCGGTCTGCCCAGCCGGGTG 
AAGTGTGGCAGTCCGGGCCAGCTCCAGGGCCTCAGCATCTTTGCACAGGACCTGCGCCTCTGCCTGGATGAGGCCCT 
(:TCCTGGGACTGTTTCGCCC7CTCGCrGCl'GGCTGTGGCTCTGGGCCTGGG?GTGCCCATGCTGCA7CACCTCTGTG 
GCTCGGACCTCTGGTACTGCTTCCACCTGlTXXTGGCCTGGCT^ 

GATGCCCTGCCCTACGATGCClTCG'rGG'rCTTCGACAAAACGCAGAGCGCAGTGGCAGACTGGGTGTACAACGAGCT 
TCGGGGGCAGCTGGAGGAGTGCCGTGGGCGCTGGGCACTCCGCCTGTGCCTGGAGGAACGCGACTGGCTGCCTGGCA 
AAACCCTC1TT(^GAACCTGTGGGCCTCGGTCTATGGCAGCCGCAAGACGCTGTTTGTGCTGGCCCACACGGACCGG 
GTCAGTGGTCTCTTC^GCGCCAGCTTCCTGCTGGCCCAGCAGCGCCTGCTGGAGGACCGCAAGGACGTCGTGGTGCT 
GGTGATCCTGAGCCCTGACGGCCGCCGCTCCCGCTATGTGCGGCTGCGCCAGCGCCrCTGCCGCCAGAGTGTCCTCC 
TCTGGCCCCACCAGCCCAGTGGTCAGCGCAGCTTCTGGGCCCAGCTGGGCATGGCCCTGACCAGGGACAACCACCAC 
TTCTATAACCGGAACTTCTGCCAGGGACCCACGGCCGAATAG 



A disclosed encoded MOL1 protein has 346 amino acid residues, referred to as the 
MOL1 protein. The MOL1 protein was analyzed for signal peptide prediction and cellular 
localization. SignalP results predict that MOL1 is cleaved between position 41 and 42 of SEQ 
5 ID NO:2. Psort and Hydropathy profiles also predict that MOL1 contains a signal peptide and is 
likely to be localized in the plasma membrane (Certainty=0.4600). The disclosed MOL1 
polypeptide sequence is presented in Table IB using the one-letter amino acid code. 



Table IB. Encoded MOL1 protein sequence (SEQ ID NO:2). 

MLAMTLAIjGTLPAFLPCELQPHGLVNCfcW 

WNCP PVGLS PMH F PCHMT I E PST FLAVPTLEELN L^^ 

FMDGNCYYKNPCRQALEVAPGALLGLGNLTHLSLKTNN^^ 

DVGGNCRRCDHAPNPCMECPRHFPQLHPDTFSHLSRLEGLVLKDSSLSWLNASHFRGLGNLRVLDLSENFLYKCITK 
T KAFQGLT QLRKLNLS FN YQKRV S FAHLS LAP £ FG S L V ALKE LDMHG I FFRS L DET TLR FLARL PMLQT LRLQMN F I 
NQAQLGIFRAFPGLRYVDLSDNRISGA^ELTATMGEADGGEKVWWPGDLAPAPVDTPSSEDFRPNCSTIiNFTLDLS 
RNNLVTVQPEMFACLSHLQCLRLSHNCISQAVKGSQFLPLTGLQVLDLSHNKLDLYHEHSFTELPRLEALDLSYNSQ 
P FGMQG VG HN FS FVAHLRT LR H LS LAHNN I HS QVSQQLC S T S LRA1 DFS GN ALGHMW AEG DL YLH F FQG LS G LI WL D 
LSQNRLHTLLPQTLRNXPKSLQVLRLRDNYLAFFKWWSLH^^ 
SISEVAPGFFSKAKELRELNLSANALKTVDHSKFGPIASALQI^ 
KCGSPGQLQGLSIFAQDLRLCLDEALSWDCFALSLIAVALGLGVP^K^ 

DALPYDAFWFDKTQSAVADWVYNELRGQLEECRGRVJALRLCLEERDKLPGKTLFENLWASVYGSRKTLFVLAHTDR 
VSGLLRAS FLLAQQRLLE DRKDVWLVI LSP DGRRSRYVRLRQRLCRQS VLLW PHQPSGQRS FWAQLGMALTRDNHH 
FYNRNFCQGPTAE 



10 MOL1 was initially identified on chromosome 3 with a TblastN analysis of a proprietary 

sequence file for a G-protein coupled receptor probe or homolog, which was run against the 
Genomic Daily Files made available by GenBank. A proprietary software program (GenScan™) 
was used to further predict the nucleic acid sequence and the selection of exons. The resulting 
sequences were further modified by means of similarities using BLAST searches. The 

15 sequences were then manually corrected for apparent inconsistencies, thereby obtaining the 
sequences encoding the full-length protein. 

A region of the MOL1 nucleic acid sequence has 690 of 1203 bases (57 %) identical to a 
Homo sapiens Toll Receptor mRNA (GENBANK-ID: AL1 37451), with an E-value of 5.7x10"*. 
In all BLAST alignments herein, the "E-value" or "Expect" value is a numeric indication of the 

20 probability that the aligned sequences could have achieved their similarity to the BLAST query 

sequence by chance alone, within the database that was searched. For example, the probability 
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that the subject ("Sbjcf ) retrieved from the M0L1 BLAST analysis, e.g., the Homo sapiens 
MOL, matched the Query M0L1 sequence purely by chance is 5.7x1 0" 8 . 

A BLASTX search was performed against public protein databases. The full amino acid 
sequence of the protein of the invention was found to have 342 of 900 amino acid residues (38%) 
5 identical to, and 493 of 900 residues (54%) positive with, the 1049 amino acid residue Toll-like 
Receptor 7 protein from Homo sapiens (ptnr:SPTREMBL-ACC: AAF60 1 88) . 

The amino acid sequence of MOL1 also had high homology to other proteins as shown in 
table 1C. 



Table 1C. BLAST results for MOL1 


Gene Index/ 
Identifier 


Protein/ 
Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


Patp:W86365 


DNAX toil-like 
receptor DTLR^ 
10 [Homo 
sapiens] 


336 


335/336 
(99%) 


335/336 
(99%) 


0.0 


gi|8394456|ref |NP 
_059138.1I 


toll-like 
receptor 9 
[Homo sapiens] 


1032 


960/1014 
(94%) 


960/1014 
(94%) 


0.0 


gi|1364B665|xer|X 
P_003236.2| 


toll-like 
receptor 9 
[Homo sapiens] 


1014 


960/1014 
(94%) 


960/1014 
(94%) 


0.0 


gi|8099654lgb|AAF 
72190. 1IAF259263 
1 


toll-like 
receptor 9 form 
B [Homo 
sapiens] 


975 


921/975 
(94%) 


921/975 
(94%) 


0.0 


gi|13507173|gb|AA 
K28488.1IAF314224 
1 


toll-like 
receptor 9 [Mus 
mus cuius] 


1032 


720/1015 
(70%) 


799/1015 
(77%) 


0.0 



10 

A ClustalW analysis comparing disclosed proteins of the invention with related OR 
protein sequences is given in Table ID, with MOL1 shown on line 1. 

In the ClustalW alignment of the MOL1 protein, as well as all other ClustalW analyses 
herein, the black outlined amino acid residues indicate regions of conserved sequence (z.e., 

15 regions that may be required to preserve structural or functional properties), whereas non- 
highlighted amino acid residues are less conserved and can potentially be mutated to a much 
broader extent without altering protein structure or function. Residue differences between any 
MOLX variant sequences herein are written to show the residue in the "a** variant and the residue 
position with respect to the "a" variant. MOL residues in all following sequence alignments that 

20 differ between the individual MOL variants are highlighted with a box and marked with the (o) 
symbol above the variant residue in all alignments herein. 
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Table ID. ClustalW Analysis of MOL1 

1) Novel MOL1 (SEQ ID NO:2) 

2) gi|S3944561ref |NP_059138.1| toll-like receptor 9 [Homo sapiens) (SEQ ID 
NO: 31) 

5 3) g.i|13648665|ref ]XP 003236.21 toll-like receptor 9 [Homo sapiens] (SEQ ID 

NO:32) 

4) gi 18099654 | gblAAF72190. 1 1 AF259263_1 toll-like receptor 9 fonr. B [Homo 
sapiens] (SEQ ID KO:33) 

5) gi |13507173|gb|AAK28488.1|AF314224_i toll-like receptor 9 [Mus musculus] 
10 (SEQ ID NO: 34) 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 




PROl 

NP_059138. 
XP_003236. 

AF259263_1 

AF314224_1 MVLRRRTLHP LSLLVQAAVjJ B &MM^tt 



PROl 

NP_059138. 
XP_003236. 
AF259263_1 
AF314224 1 



PROl 

NP_059138. 
XP_003236 . 
AF259263_1 
AF314224 1 



PROl 

NP 059138. 
XF~003236 . 
AF259263_1 
AF314224 1 



PROl 

NP_059138 
XP_003236 
AF259263_* ™ 
AF314224_1 gKj 



PROl 

NP 059138. 
XPJ>03236. 
AF259263 1 
AF314224~1 




342 
360 
342 
303 

pjgKgg E^B^ffisBS 360 



E'AHL SLAPS I' 
fAHL SLAPS F 
FAHLSLAP3F 
rAHLSLAPSF 



PROl 

NPJ)5913B. 
XP~003236. 
AF25 9263_1 
AF314224 1 



PROl 

NP_059138. 
XP_003236. 
AF259263_1 
AF314224 1 




.1- 



390 

.1 . 



400 
■ .1 



ETTLRPLARLB PMLQTLRLQMKFI 

ErrLRPLARil pmlqtlrlqmKfi 
sitiiRpiari|pmlqtlrlqmBe'I 
ettlrplarJpmlqtlrlqm^fi 

ft 



430 



440 



450 



460 





490 



500 



510 



520 



530 



540 



11 



BNSDCCID: <WO__01B1678A2_I_> 



WO 01/81578 



PCT/US01/13578 



10 



15 



20 



25 



30 



35 



40 



45 




PROl 

NP_059138. 
XP_003236. 
AF259263 1 
AF314224~1 



PROl 

NP_059138. 
XP_003236. 
AF259263 1 
AF314224~1 



PROl 

NP_059138. 
XP_003236. 
AF259263__1 
AF314224 1 



PROl 

NP_059138 . 
XP_003236 . 
AF259263_1 
JLT314224 1 



PROl 

NF_059138. 
XP_003236. 
AF259263_1 
AF314224 1 



610 

.1 



620 



630 



640 
.1 



650 
..I . 



660 
• I 




50 



55 



60 



65 



70 



75 



PROl 




3 WD 


LI 


NP_059138. 






LI 


XP_003236. 






LI 


AF259263 1 




3RD 


LI 


AF3 14224*1 





PROl 

HP_059138 . 
XP_003236 . 
AT259263_1 
AF314224 1 



PROl 

MP_059138. 
XP_0 03236. 
AT2592631 
AF314224 1 



PROl 

NP 059138. 
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XP_003236. 53^PS5*|B SS 1014 
AF259263_1 §g^ g|f 975 

AF314224 1 ^MQN^n^ jgi 1032 



5 

The interleukin-1 (IL-1) receptor/Toll-like receptor (TLR) superfamily is a recently 
defined and expanding group of receptors that participate in host responses to injury and 
infection. The superfamily is defined by the Toll/EL-l receptor (TTR) domain, which occurs in 
the cytosolic region of family members, and is further subdivided into two groups b§sed on 

10 homology to either the Type I IL-1 receptor or Drosophila Toll receptor extracellular domain. 
The former group includes the receptor for the important Thl cytokine IL-1 8, and T1/ST2, 
which may have a role in Th2 cell function. The latter group includes six mammalian TLRs, 
including TLR2 and TLR4, that largely mediate the host response to gram-positive and gram- 
negative bacteria, respectively. Whether bacterial products are actual ligands for TLRs, or 

1 5 whether they generate ligands via as yet unidentified pattern recognition receptors, has yet to be 
determined. Signaling pathways activated via the TTR domain trigger the activation of 
downstream kinases, and transcription factors such as NF-kappaB, and involve the adaptor 
protein MyD88, which itself contains a HR domain. 

As our primary interface with the environment, the skin is constantly subjected to injury 

20 and invasion by pathogens. The fundamental force driving the evolution of the immune system 
has been the need to protect the host against overwhelming infection. The ability of T and B cells 
to recombine antigen receptor genes during development provides an efficient, flexible, and 
powerful immune system with nearly unlimited specificity for antigen. The capacity to expand 
subsets of antigen-specific lymphocytes that become activated by environmental antigens 

25 (memory response) is termed "acquired" immunity. Immunologic memory, although a 

fundamental aspect of mammalian biology, is a relatively recent evolutionary event that permits 
organisms to live for years to decades. "Innate" immunity, mediated by genes that remain in 
germ line conformation and encode for proteins that recognize conserved structural patterns on 
microorganisms, is a much more ancient system of host defense. Defensins and other 

30 antimicrobial peptides, complement and opsonins, and endocytic receptors are all considered 
components of the innate immune system. None of these, however, are signal-transducing 
receptors. Most recently, a large family of cell surface receptors that mediate signaling through 
the NF-kappaB transcription factor has been identified. This family of proteins shares striking 
homology with plant and Drosophila genes that mediate innate immunity. In mammals, this 

35 family includes the type I interleukin-1 receptor, the interleukin-1 8 receptor, and a growing 
family of Toll-like receptors, two of which were recently identified as signal-transducing 
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receptors for bacterial endotoxin. In this review, we discuss how interleukin-1 links the innate 
and acquired immune systems to provide synergistic host defense activities in skin. 

In Drosophila the Toll protein is involved in establishment of dorso-ventral polarity in 
the embryo. In addition, members of the Toll family play a key role in innate antibacterial and 
5 antifungal immunity in insects as well as in mammals. These proteins are type -I transmembrane 
receptors that share an intracellular 200 residue domain with the interleukin-1 receptor (IL-1R), 
the Toll/IL-IR homologous region (TIR). The similarity between Toll-like receptors (LTRs) and 
IL-1R is not restricted to sequence homology since these proteins also share a similar signaling 
pathway. They both induce the activation of a Rel type transcription factor via an adaptor protein 

1 0 and a protein kinase. Interestingly, MyD88, a cytoplasmic adaptor protein found in mammals, 
contains a TIR domain associated to a DEATH domain (see IPR000488 ). Besides the 
mammalian and Drosophila proteins, a HR domain is also found in a number of plant proteins 
implicated in host defense. As MyD88, these proteins are cytoplasmic. Site directed mutagenesis 
and deletion analysis have shown that the TIR domain is essential for Toll and IL-1R activities. 

1 5 Sequence analysis have revealed the presence of three highly conserved regions among the 
different members of the family: box 1 (FDAFISY), box 2 (GYKLC-RD-PG), and box 3 (a 
conserved W surrounded by basic residues). It has been proposed that boxes 1 and 2 are involved 
in the binding of proteins involved in signaling, whereas box 3 is primarily involved in directing 
localization of receptor, perhaps through interactions with cytoskeletal elements 

20 Toll is a Drosophila gene essential for ontogenesis and antimicrobial resistance. Several 

hortologues of Toll have been identified and cloned in vertebrates, namely Toll-like receptors 
(TLR). Human TLR are a growing family of molecules involved in innate immunity. TLR are 
structurally characterized by a cytoplasmic Toll/interleukin-lR (TTR) domain and by 
extracellular leucine-rich repeats. TLR characterized so far activate the MyDS8/IRAK signaling 

25 cascade, which bifurcates and leads to NF-kappaB and c-Jun/ATF2/TCF activation. Genetic, 
gene transfer, and dominant-negative approaches have involved TLR family members (TLR2 
and TLR4) in lipopolysaccharide recognition and signaling. Accumulating evidence suggests 
that some TLR molecules are also involved in signaling receptor complexes that recognize 
components of gram-positive bacteria and mycobacteria. However, the definitive role of other 

30 TLR is still lacking. A systematic approach has been used to determine whether different human 
leukocyte populations selectively or specifically expressed TLR rnRNA. Based on expression 
pattern, TLR can be classified as ubiquitous (TLR1), restricted (TLR2, TLR4, and TLR5), and 
specific (TLR3). Expression and regulation of distinct though overlapping ligand recognition 
patterns may underlie the existence of a numerous, seemingly redundant, TLR family. 
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Alternately, the expression of a TLR in a single cell type may indicate a specific role for this 
molecule in a restricted setting. 

The amino acids differences between the three MOL1 proteins are shown in Tabic 1H. 
Deletions are marked by a delta (A). The differences between the three proteins appear to be 

5 localized to a few distinct regions. Thus, these proteins may have similar functions, such as 
serving as olfactory or chemokine receptors. 
Uses of the Compositions of the Invention 

The above defined information for this invention suggests that this Toll Receptor-like 
protein may function as a member of a 'Toll Receptor family"- Therefore, the novel nucleic 

10 acids and proteins identified here may be useful in potential therapeutic applications implicated 
in (but not limited to) various pathologies and disorders as indicated below. The potential 
therapeutic applications for this invention include, but are not limited to: protein therapeutic, 
small molecule drug target, antibody target (therapeutic, diagnostic, drug targeting/cytotoxic 
antibody), diagnostic and/or prognostic marker, gene therapy (gene delivery/gene ablation), 

15 research tools, tissue regeneration in vivo and in vitro of all tissues and cell types composing (but 
not limited to) those defined here. 

The nucleic acids and proteins of the invention are useful in potential therapeutic 
applications implicated in pancreatic cancer, adenoma, and other cancers, Larsen syndrome, 
tachycardia, erythroderma, night blindness, long QT syndrome, brugada syndrome, heart block, 

20 cell-mediated immunity, and applications as a mediator in inflammation and/or other pathologies 
and disorders. For example, a cDNA encoding the Toll Receptor-like protein may be useful in 
gene therapy, and the Toll Receptor-like protein maybe useful when administered to a subject in 
need thereof. By way of nonlimiting example, the compositions of the present invention will 
have efficacy for treatment of patients suffering from pancreatic cancer, adenoma, and other 

25 cancers, Larsen syndrome, tachycardia, erythroderma, night blindness, long QT syndrome, 
brugada syndrome, heart block, cell-mediated immunity, and applications as a mediator in 
inflammation. The novel nucleic acid encoding Toll Receptor-like protein, and the Toll 
Receptor-like protein of the invention, or fragments thereof, may further be useful in diagnostic 
applications, wherein the presence or amount of the nucleic acid or the protein are to be assessed. 

30 These materials are further useful in the generation of antibodies that bind immunospecifically to 
the novel substances of the invention for use in therapeutic or diagnostic methods. 
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MOL2 

An additional murine GNC2 elFK -like protein of the invention, referred to herein as 
MOL2, is an Olfactory Receptor ("OR")-like protein. The novel nucleic acid of 4989 
nucleotides, (20466828_EXT1, SEQ ID NO:3) encoding a novel GNC2 elFK-like protein is 
5 shown in Table 2 A. 



Table 2A. MOL2 Nucleotide Sequence (SEQ ID NO:3) 

Ttggctgggggcot 

gctacaggccctggaggccatcrracggcgcggacttccaagacctgcggccggacgcttgcggaccggttaaggtca 

aagagccccctgaaatcaatttagttttgtaccctcaaggcctaactggtgaagaagtatatgtaaaagtggatttg 

agg gt t aaatgcccacct acc t atcc agatgt ag ttcctg aaat agagtt aaaaaatgccaaaggtct atcaaatga 

aagtgtcaatttgttaaaatc tcgcct ag aagaactggccaagaaacactgtgggg aggtagtgat gatctt tg aac 

tggcttaccacgtgcagtcatttctcagcgagcat7acaagccccctcccaagtcttttcatgaagaaatgctggaa 

aggcgggctcaagaggagcaacagaggctgttggaggcccaagcggaaagaagagcacagcaacg 

tgagattcagagaaggaaagaagagataaaagaagagaaaaaaaggaaagaaatggctaagcaggaacgtttggaaa 

ttgctagtttgtcaaaccaagatcatacctctaagaaggacccaggaggacacagaacggctgccattctacatgga 

ggctctcctgactttgtaggaaatggtaaacatcgggcaaactcctcaggaaggtctaggttaaggcgagaacgtc^ 

gtattctgtatgtaatagtgaagattctcctggctcttgtgaaattctgtatttcaatatggggagtcctgatcagc 

tcat g g t gcacaaagggaaat gt att g g cagtgatgjtftcaactt ggaaaat t ag t c t acaatgctt tggaaac agcc 

actggtggctttgtcttgttgtatgagtgggtccttcagtggcagaaaaaaatgggtccattc 

aaaag ag aagat t gat aagt g gaaaaagcagatt caagg aacagaaacagaat t caact cact ggt aaaatt gag c c 

atccaaatgtagtacgctaccttgcaatgaattcaaagagcaagacgactccattcgtggtggacattttagtggag 

cacattagtggggtctctcttgctgcacacctgagccactcaggcccc^tccctgtgcatc^ 

agctcagctcctgtcaggccttgattatctgcacagcaattctgtggtgcataaggtcctgagtgcat c taatgtct 

tggt ggatgcagaaggcaccg tcaag att acggact at agcatt t c taagcgcctcgcagacattt gc aagg aggat 

gtgtttgagcaaacccgagttcgttttagtgacaatgctctgccttataaaacggggaagaaaggagatgtttggcg 

tcttggccttctgctgctgtccctcagccaaggacaggaatgtggagagtaccctgtgaccatccctagtgacttac 

cagctgactttcaagattttctaaagaagagatgtgtgtgcttggatgacaaggaaagatggagtccccagcagttg 

ttgat^acacagctttataaatccccagccaaaaatgcctctagtggaacaaagtcctgaatctgaaggacaagatta 

TGTTGAGACTGTTATTCCTAGCAACCGGCTACCCAGTGCTGCCTTCTTTAGTGAGACACAGAGACAGTTTTCCCGAT 
ACTTCATTGAGTTTGAAGAATTACAACTTCTTGGTAAAGGAGCTTTTGGAGCTGTCATC^GGTGCAGAACAAGTTG 
GACGGCTGCTGCTACGCAGTGAAGCGCATCCCCMCAACCCGGC^ 
GACACTGCTGTC&CGGCrGOlCCATGAGAACATTGTGCGCTACTACAA^ 

CGGGACCGGGGACGCCGCCCCCGGACTCCGGGCCCCTGGCCAAGGATGACCGAGCTGCACGCGGGCAGCCGGCGAGC 
GACACAGACGGCCTGGACAGCGTAGAGGCCGCCGCGCCGCCACCCATCCTCAGCAGCTCGGTGGAGTGGAGCACTTC 
GGGCGAGCGCTCGGCCAGTGCCCGTTTCCCCGCCACCGGCCCGGGCTCCAGCGATGACGAGGACGACGACGAGGACG 
AGCIACGGTGGCGTCTTCTCCCAGTCCTTCCTGCCTGCTTCAGATTCTGAAAGTGATATTATCTTTGACAATGAAGAT 
GAGAACAGTAAAAGTCAGAATCAGGATGAAGATTGCAATGAAAAGAATGGCTGCCATGAAACTGAGCCATCAGTGAC 
GACTG AGGCT GTGCACTACCT ATACATCC AG ATGGAGT ACTGT GAGAAGAGCACTTT ACGAGACAGCATTGACCAGG 
GACTGTATCGAGACACCGTCAGACTCTGGAGGCTTTTTCGAGAGATTCTGGATGGATTAGCTTATATCCATGAGAAA 
GGAATGATTCACCG3GATTTGAAGCCTGTCAACATTTTTTTGGM 

TTTGG CGACAGAC CATCT AGCCTTTT CT G CTGACAGCAAACAAGACGATCAGACAG G AGAC TTGATT AAGT CAGAC C 

CTTCAGGTCACTTAACTG^GATGGTTG^CACTGCTCTCTATGTAAGC^ 

TACAACCAGAAAGTGGATCTCTTGAGCCTrGXSGAATTATCTTCT^TGA^ 

AGAAAGGATCTTTGITCTCAACCAACTCAGAGATCCCACTTCGCCTAAGTTTCCAGAAGACTTTG^ 

ATGCAAAGCAGAAATCAG TCAT CTCCTGGCTGTTG AAC C^CGATCCAGCAAAAC^GCCCACAGCCAC^ 

AAGAGTGAGCTGCTGCCXCCACCCGAGATGGAGGAGTCAGJlGCTGCATGAACT 

GGATGGGAAGGK:CTACCGCACCATGATGGCCCAGATCTTCTCGC^GCGCATCTCCCCTG^CATCG^^ 

ACAGCG AC AT ACT GAAGGGCAACT T CTCAATCCGTACAGCCAAGAT GCAGCAGCATGTGTGTGAAACCATCATCCGC 

ATCTT T AAAAG ACATGGTGCTGTT CACTTGTCTACTCC ACT ACTG CTT(XCCG AAACAGAC AAATATATGAGCAC^A 

O^GCTGCCCTATTCATGCTCCACAGCGGCATGCTG^GATGCTTCCTT^ 

ATGTGGCAAGAAATAATATATTGAATTTATW^CGGTACTGCATAGAACGTGTGTTCACGCCGCGCAAGTTAGATCGA 

TTTCATCCCAAAGAACTTCT GGAGTGTGCCTTTGATATTGTCACT TCTACCACCAACAGCTTTCTGCCC ACTGCTGA 

AATTATCTACACTATCTATGAAATC^TCCAAGJVGTTTCXAGCACTTCAAGAAAGAAATTACAGTATTTATTTG^ 

ATACCAT GTT ATTGJtf^GCAATAC TCTTACACTCTG<5GATCCCAG AAGATAAACTCAGT D^CT 

TATGATGCTGTGACAGAGAAGCTGACGAGGAGJVGAAGTGGAAGCTAAA1TTTGTAATCTGTCTTT 

TCTGT GTCGACTCT ACAAGTT T AT TGAACAGAAGGGAGAT TTGCAAGATCTTATGCCAACAATAAATT C ATT AAT AG 

AACAGAAAACAGGTATTGCACAGTTGGTGJ^GTATGGCrTAAAAGACCTAGAG^GGTTGTTGGACTGTTGAAGAAA 

CTTCG<iK^TCAAGTTAGAGGTTTGGGTCTTGATCMTTTGGGCTTGGTTTA 

CTTCCAGTT TGT G GCT AT CAT CAAAC GAAGGCAAAGG GCT GT AC C TGAAATCCTCG CAGC TGGAGGCAG AT AT G ACC 
TGCTGATTCCCCAGITTAGAGGXTCCACAAGCTCT 
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AAGATATCTGCTGCTGTCCTCAACATGGAGGAATCTGTAAGTTCTGTTACAATAGGCTCTGGGGACCTCCTGGTTGT 
AAGTGTGGGCC^GATGTCTATGTCCAGGGCCATAAACCTAACCCAGAAACTCTGGACAGCAGGCATCACAGCAGAAA 
TC ATG T AC G ACTGGTC AC AGT TT C AGTCCCAAGAGGAAT T AC AAG AGT AC TGCAG ACATCATG AAATCACC T ATGTG 
GC CCT TGT CT CGG AT AAAGAAGGAAGCCATGT C AAGG TT AAG TCTTTCGAG AAGGAAAGGCAGAC AGAG AAGCGT GT 
GCTGG AG ACT GAACTT GT GGACCATGT ACT GC AG AAACTG AGGACTAAAGT CACTG ATGAAAGG AATTTT AG AGAAG 
CTTCCGATAATCT^GCAGTGCAAAATCTGAAGGGGTCATTTTCTAATGCTTCAGGTTTGTTTGAAATCCATGGAGCA 
ACAGTGGTTCCCATTGTGAGTGTGCTAGCCCCGGAGAAGCTGTCAGCCAGCACTAGGAGGCGCTATGAAACrCAGGT 
ACAAACTCGACTTCAGACCTCCCTTGCCAACTTACATCAGAAAAGC^GTGAA^ 

T ACCCAAAG AAACAAT ATTACAGTTTTTATCATT AG AGTGGG ATGCTG AT GAAC AGG CATT T AACACAACTGTGAAG 
CAGCT GCTGTC ACGCCTGCC AAAG CAAAGATAC CTCAAAT T AGTCTGT G ATGAAAT TT AT AACATC AAAGTAGAAAA 
AAAGGTGTCTGTGCTATTTCTGTACAGCTATAGAGATGACTACTACAGAATCTTATTTTAA 



An open reading frame (ORF) for MOL2 was identified from nucleotides 1 to 4986. The 
disclosed MOL2 polypeptide (SEQ ID NO:4) encoded by SEQ ID NO:3 is 1662 amino acid 
residues, has a molecular weight of 188250.1 and is presented using the one-letter code in Table 
5 2B. The SignalP, Psort and or Hydropathy profile of MOL2 indicate that this sequence does not 
have a signal peptide and is likely to be localized to the nucleus. Therefore it is likely that 
MOL2 is available at the appropriate sub-cellular localization and hence accessible for the 
therapeutic uses described in this application. 



Table 2B. Encoded MOL2 protein sequence (SEQ ID NO:4). 



MAGGRGAPGRGRDE P PES YPQRQDKELQALE AI YGADFQDLRPDACGPVKVKEP PE IMLVLY PQGLTGEE VYVKVDL 
RVKCPPTYPDWPEIELKNAKGLSNES\^LLKSRLEELAKKHCGEVVMIFEIjAYHVQSFLSEHNKPPPKSFHEEMLE 
RRAQ3EQQRLLE AQAERRAQQRE I LHE IQRRKEE I KEEKKRKEMAKQERLEI ASLSNQDHT S KKDPGGKRTAAILHG 
GSPDFVGNGKHRAHSSGRSRLRRERQYSVCNSE ds PGSCEILYFNMGS PDQLMVHKGKCIGS DSQLGKLVYNALETA 
TGGFVLLYEWLQWQKKMGPFLTSQEKEKIDKCKKQIQ^ 

HrSGVSIiAAHLSHSGPIPVHQLRRYTAQLLSGLDYLHSNSWKKVLSASNVLVDAEGTVKITDYSISKRLADICKED 
VFEQTRVRFS DNALPYKTGKKGDVWRLGLLLLS LSQGQECGE YPVT I PS DLPADFQDFLKKRCVCLDDKERWS POOL 
LKHSFINPQPKMPLVEQS PESEGQDWETVI PSNRLPSAAFFSETQRQFSRYFIEFEELQLLGKGAFGAVIKVQNKL 
DGCCYAVKRIPINPASRQFRRIKGEVTLLSRLHHSNIVRYYKAWIERHERPAGPGTPPPDSGPLAKDDRAARGQPAS 
DTDGLDSVEAAAPPPIIiSSSVEWSTSGERSASARFPATGPGSSDDEDDDEDEHGGVFSQSFLPASDSESDIIFDMED 
ENSKSQNQDEDCNEKNGCHESEPSVTTEAVHYLYIQMEYCEKSTLR^ 

GMIHRDLKPVKIFLDSDDHVKIGDFGLATDHLAFSADSKQDDQTGDLIKSDPSGHLTGMVGTALYVSPEVQGSTKSA 
Y«QKVDLFSLGIIFFEMSYHPMVTASERIFVLNQLRDPTSPKFPEDFDDGEHAKQKSVISWLLNHDPAKRPTATELL 
KSELLPPPQKEESELHE^KTLTKVDGKAYRTMMAQIFSQRISPAIDYTYDSDILKGNFSIRTAKMQQHVCETIIR 
IFKRHGAVQIXrTPLLLPRNRQIYEHNEAALraDHSGMLVMLPFDLRVPFARYVARNWILNLKRYCIERVFTPRKLDR 
FHFKELLECAFDIVTSTTNSFLPTAEI IYTIYE IIQEFPALQERNYSIYLNHTMLLKAILIjHCGIPEDKLSQVYI IL 
YDAVTEKLTRREVEAKFCNLSLSSNSLCRLYKFIEQKGDLQDLMPTINSLIEQKTGIAQLVKYGLKDLEEVVGLLKK 
LGIKLQVWVLINLGLVYKVQ2HNGI IFQFVAIIKRRQRAVPEILAAGGRYDIiLI PQFRGPQALGPVPTAIGVSXAID 
KISAAVLNMEES VS SVT I GSGDLLWSVGQMSMSRAINLTQKLWTAGITAEIMYDWSQFQSQEELQEYCRHHEITYV 
ALVSDKEGSHVKVKSFEKERQTEKRVLETELVDH^ 

TVVPIVSVLAPEKLSASTRRRYETQVQTRLQTSLANLHQKSSEIEILAWDIiPKETILQFLSLEW 

QLLS RL PKGR YIiKL VC PS I YN I KVE KKVS VL FL YS YRDDY YR I L F 



10 The MOL2 nucleic acid sequence has 3 1 19 of 3723 bases (83 %) identical to a Mm 

musculus GCN2 EIF2alpha kinase mRNA (GENBANK-ID: MMU243533|acc:AJ243533 ) 

The full amino acid sequence of the protein of the invention was found to have 479 of 
1662 bases (88 %) amino acid residues (88 %) identical to, and 1554 of 1662 residues (93 %) 
similar to, the 1648 amino acid residue CAB58363 GCN2 EIF2alpha kinase protein from Mus 
15 musculus (ptnr: TREMBLNEW-ACC: CAB58363 ). 

Other BLAST results including the sequences used for ClustalW analysis are presented in 
Table 2C. 
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Table 2C. BLAST results for MOL2 


Gene Indsx/ 
Identifier 


Protein/ 
Organism 


Length 
(aa) 


1 dsn t i t y 
(%) 


Po s i tive s 
. (%) 


Expect 


PatP:B65663 


Protein 
kinase [Homo 
sapiens] 


1649 


1626/1662 
(97%) 


1632/1662 
(90%) 


0.0 


PatP:B43581 


Cancer 
associated 
protein [Homo 
sapiens] 


604 


592/609 
(97%) 


594/609 
(97%) 


6.5e- 
306 


PatP:B42761 


ORF2525 
polypeptide 


619 


585/624 
(93%) 


596/624 
(95%) 


1.3e- 
300 


gi| 107 64165 igb|AAG2 
2591.1 | (AF193344) 


GCN2gamma 
[Mus 

mus cuius] 


1570 


1313/1577 
(83%) 


1375/1577 
(86%) 


0.0 


gi|1136C320|pir| |T4 
5924 


probable 
translation 
initiation 
factor elF- 
2 alpha kinase 
(EC 2.7.1.-) 
[similarity] 
- human 
(fragment) 


938 


867/946 
(93%) 


889/946 
(93%) 


0.0 


gi|7305017|ref |NP 0 
38747.11 


GCN2 

el F2 alpha 
kinase [Mus 
mus cuius] 


1649 


1374/1647 
(83%) 


1442/1647 
(87%) 


0.0 


gi 1 107641 61 | gb | AAG2 
25B9.1I 


GCK2 alpha 
[Mus 

mus cuius] 


1370 


1189/1381 
(86%) 


1241/1381 
(89%) 


0.0 


gi|7243057 Idbj IBAA9 
2576.11 (AB037759) 


KIAA1338 
protein [Homo 
sapiens] 


1495 


1377/1460 
(S4%) 


1380/1460 
(94%) 


O.C 


gi|10764163igb|AAG2 
2590.11 (AF193343) 


GCN2beta [Mus 
mus cuius] 


1648 


1373/1647 
(83%) 


1440/1647 
(87%) 


0.0 


gi | 6065914 ] emb | CABS 
8360.11 (AJ243428) 


putative eIF2 
alpha kinase 
[ Homo 
sapiens] 


546 


505/556 
(90%) 


507/556 
(90%) 


0.0 



This information is presented graphically in the multiple sequence alignment given in 
5 Table 2D (with MOL2 being shown on line 1) as a ClustalW analysis comparing MOL2 with 
related protein sequences. 



Table 2D. Information for the ClustalW proteins: 

1) Novel M0L2 (SEQ ID NO: 4) 
10 2) gi|10764165|gb|AAG22591.1| (AF193344) GCN2gamraa [Mus mus cuius] (SEQ ID 

NO: 35) 

3) gi I 11360320|pir| IT46924 probable translation initiation factor elF- 
2alpha kinase (EC 2.7.1.-) [similarity] - human (fragment) (SEQ ID NO: 36) 

4) gi|7305017|ref|NP_O38747.1| GCN2 eIF2alpha kinase [Mus musculus] (SEQ 
15 ID NO:37) 

5) gi ( 724 3057 | dbj | BAA92576.il (AB037759) KIAA1338 protein [Homo eapiensj 
(SEQ ID NO: 38) 

6) gi|10764163|gb|AAG22590.1| (AF193343) GCN2beta [Mus musculus] (SEQ ID 
NO:39) 
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7 J gi | 6065914 |emb|CAB58360.1 I 
sapiens] (SEQ ID NO: 40) 



(AJ2434 28) putative eIF2 alpha kina3e [Homo 



10 



MOL2 Prote 

AF193344 

T46924 

NP_038747. 

AAC22S89 . 1 

AB037759 

AF193343 

AJ243428 



MDL2 Prote 

ATI 93344 

T46924 

MP 038747. 

AAG225B9.1 

AB037759 

AF193343 

AJ243428 



MDL2 Prote 
AF193344 
T46924 
NP 038747. 
AAG22589 . 1 
ABO 37 759 
ATI 93343 
AJ243428 



MOL2 Prote 

AF193344 

T46924 

NP_038747. 

AAG22 589.1 

AB037759 

AF193343 

AJ243428 



M0L2 Prote 

AF193344 

T46924 

NP_038747. 

AAG22589.1 

AB037759 

AF193343 

AJ243428 



MDL2 Prote 
ATI 93344 
T46924 
HP_038747. 
AAG22569.1 
AB037759 
AF1 93343 
AJ243428 



M0L2 prote 

ATI 9334 4 

T46924 

HP_038747. 

AAG22589.1 

AB037759 

AF193343 



20 30 40 50 60 

....|. ...| ....]....! ....I. ...I ....I. ...I I I ....|.... I 

HAGGRGAPGR GRDEPPESYP QRQDHF.TtQAL EATYGADFQD LRPDACGPVK VKEPPEINLV 
MRTORAL 

MAGGRGASGR GRAEPQESYS QRQDHELQAL EAI YGSDFQD LRPDARG — R VREPPEINLV 



LLL 

HAGGRGASGR GRAEPQESYS ORQDHELQAL EAI YGSDFQD LRPDARG — R VKEPPEINLV 



70 60 90 100 110 120 

I .... I |. ...| ....).... | ....|. ...| ....|. ...| 

LYPQGLTGEE VYVKVDLRVK CPPTYPDVVP EIELKNAKGL SNESVNTOkS RLEELAKKHC 
LVP EIELKNAKGL SNESVNW*kS HLEELAKKQC 



LYPQGLAGEE VYVQVELQVK CPPTYPDVVP EIELKNAKGL SNESVN] 

FFPLGLQ- QPL3 

LYPQGLAGEE VYVQVELQVK CPPTYPDVVP EIELKNAKGL SNESVN] 



HLEELAKKQC 



HLEELAKKQC 



130 140 

I.... I ....I. ...| . 

GEWKIFELA YHVQSFLSEH Nl 
G-EVKIFELA HHVQSFLSEH Nl 



150 160 170 180 

.. I .... I ....I.... | ....I.... I 
^PKSFHE ENLERRAQEE QQRLLEAQAE RRAQQREI] 
SJpKSFHE ENLERQAQEK QQRLLEARRK EEQEQREI] 



G-EVMIFELA HHVQSFLSEH NKggPKSFHE EMLERQAQEK QQRLLEARRK EEQEQREILfl 



G-EVMIFELA HHVQSFLSEH NKgjPKSFHE EMLERQAQEK QQRLLEARRK EEQEQREI! 



200 
.I.... I 

32 




eiqrrkeeik«h:ekkrkemak|qerlei 

ETQRRKEEIKaSEKKRKENAfSQERLEI 



H32S3 T SB' r mRByaSSr|5^ Agj 



EIQRRKEEIKflSEKKRKEMAKl^ERLEI 

siqrrkeeikIeekkrkemakIqerlei 




120 

40 

1 

118 
1 

18 

lie 
i 



180 

99 

1 

177 
1 

23 

177 

1 



240 
159 
1 

237 
1 

83 

237 

1 



300 
217 
1 

295 
17 
141 
295 
1 
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AJ24342B 



MOL2 Prote 
AF193344 
T46924 
NP 038747. 
AAG22589 . 1 
ABO 37 7 59 
API 933 4 3 
AJ243428 



MOL2 Pro to 

API 9334 4 

T46924 

NP_038747. 

AAG22589.1 

AB037759 

AF193343 

AJ243428 



MOL2 Prote 

AF193344 

T46924 

NP_038747 . 

AAG22589.1 

AB0377S9 

AF193343 

AJ243428 



MOL2 Prote 

AF193344 

T46924 

NP 038747. 

AAG22589 .1 

AB037759 

AF193343 

AJ243428 



MOL2 Prote 
AF193344 
T46924 
NP_038747 . 
AAG22589.1 
AB037759 
ATI 933 4 3 
AJ243428 



M0L2 Prote 
API 933 4 4 
T46924 
NP_038747 . 
AAG225B9.1 
AB037759 
AP193343 
AJ243428 



MOL2 Prote 

AF193344 

T46924 

HP 03B747. 

AAG22589.1 

AB037759 

API 93343 

AJ243428 



430 





EGTVKITDYS 
EGTVKITDYS 
EGTVKITDYS 
EGTVKITDYS 




rSDSALPYKT 
PSDSRLPYKT 
PS _ 
PSDSALPYKT 



490 



500 



510 



520 



530 



KKGDVWRL 
KKGDVWRL 1 



4 



E-S52SE rofriiwr 






FFSETQjQFS 
FFSETQ j QFS 

ffsetq'qfs 

FFSET020FS 




610 



620 



630 



640 



650 



LLGKGAFGA 
LLGKGAFGA 



LLGKGAFGAVfllKVQ^KLDG 
LLGRGAFGAVllKV^KLDG 

i^gkgafgavIikvSkldg 
llgkgafgav|ikvc1kldg 




SEVTLLS 
GEVJMJiS 
G3VTLLS 




670 
..I . 



6B0 



690 



700 



710 



tflERHERPAM PGTPPP 
xnERHERPAa PGTPPP 



I I 

!Gg L^kHdrHafB QPASBjDGfflO Wn>&MJ-J;» MsMAM 

:Tg qBqBsp0ty3 ktsgEeeBs BffEFTWWl BBSB; 



480 

396 

1 

474 
196 
321 
474 
1 



540 



540 
455 
- 1 
533 
255 
380 
533 
1 



599 
515 
1 

593 
315 
440 
593 
1 



660 



659 
575 
1 

653 
375 
500 
653 
1 



720 



719 
635 
3 

713 
435 
560 
713 

1 
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1270 



1200 



1290 



1300 



1310 



1320 



MDL2 Prote 

AF193344 

T46924 

KP 038747. 

AAC22589 . 1 

AB037759 

AF193343 

AJ243428 



MOL2 Prote 

AF193344 

T46924 

NPJ>38747 . 

AAG225B9.1 

AB037759 

AF193343 

AJ243428 



MOL2 Prote 

AF193344 

T46924 

NP_038747. 

AAG22589.1 

AB037759 

AF193343 

AJ243428 



KOL2 Prote 

AF193344 

T46924 

NP_038747 . 

AAG22589.1 

AB037759 

AF193343 

AJ243428 



MDL2 Prote 

AF193344 

T46924 

NP_038747 . 

AAG22589.1 

AB037759 

AF193343 

AJ243428 



MDL2 Prote 
AF193344 
T46924 
NP_03B747 . 
AAG22589.1 
AB0377S9 
API 93343 
AJ243428 



MOL2 Prote 
AF1 93344 
T46924 
NP_038747 . 
AAG22589 . 1 
ABO 37 7 59 
AF1 93343 
AJ243428 




1570 
■ I 



1560 



1S90 
• • I 



1600 



1610 



1620 
• 1 




LLSRLPKQRY LKL 

LLSRLPKQRK LKL 

LLSRLPKQRY LKL 

LLSRLPKQRY L.KL 

LLSRLPKQRY LKL 

LLSRLPKQRY LKL 

LLSRLPKQRY LKL 

LLSRLPKQRY LKL 
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Chromosomal information 

MOL2 belongs to genomic DNA [Acc.NO.: AC025168 from GenbaiikNEW]. Within this 
GenbankNew entry was a note showing that the sequence was from Chromosome 15ql4. 
5 Therefore we assign the chromosomal locus of this invention as Chromosome 1 5ql4. 



Tissue expression 

MOL2 is expressed in at least the following tissues: brain and liver (derived from 
literature sources) and thyroid (derived from 20466828_EXT1). 

10 Based on information available on expression of SWISSPROT-ACC:P29089 TYPE-IB 

ANGIOTENSIN U RECEPTOR (AT1B) (AT3) - Rattus nor\>egicus (Rat), the closest G-protein 
coupled receptor family member it is likely that MOL2 is expressed in cardiac tissue, renal 
tissue, and vascular tissue as angiotensin is expressed in these tissues.MOL2 has similarity to the 
murine GNC2 elFK protein, a possible GNC2 elFK and other GNC2 elFKs and their functions 

15 as described in but not limited to the references below: In cukaryotic cells, protein synthesis is 
regulated in response to various environmental stresses by phosphorylating the alpha subunit of 
the eukaryotic initiation factor 2 (eEF2alpha) (Berlanga, et.al., Eur J Biochem; 265(2):754-62; 
Oct, 1999). Three different eEF2alpha kinases have been identified in mammalian cells, the heme- 
regulated inhibitor (HRI), the interferon-inducible RNA-dependent kinase (PKR) and the 

20 endoplasmic reticulum-resident kinase (PERK). A fourth eIF2alpha kinase, termed GCN2, was 
previously characterized from Saccharomyces cerevisiae, Drosophila melanogaster and 
Neurospora crassa. Berlanga et al. 1999 describe the cloning of a mouse GCN2 cDNA 
(MGCN2), which represents the first mammalian GCN2 homolog. MGCN2 has a conserved 
motif, N-terminal to the kinase subdomain V, and a large insert of 139 amino acids located 

25 between subdomains IV and V that are characteristic of the known eIF2alpha kinases. 

Furthermore, MGCN2 contains a class II aminoacyl-tRNA synthetase domain and a degenerate 
kinase segment, downstream and upstream of the eIF2alpha kinase domain, respectively, and 
both are singular features of GCN2 protein kinases. MGCN2 mRNA is expressed as a single 
message of approximately 5.5 kb in a wide range of different tissues, with the highest levels in 

30 the liver and the brain. Specific polyclonal anti-(MGCN2) immunoprecipitated an eDF2alpha 

kinase activity and recognized a 190 kDa phosphoprotein in Western blots from either mouse 

liver or MGCN2-transfected 293 cell extracts. Interestingly, serum starvation increased 

eIF2alpha phosphorylation in MGCN2-transfected human 293T cells. This finding provides 

evidence that GCN2 is the unique eEF2alpha kinase present in all eukaryotes from yeast to 
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mammals and underscores the role of MGCN2 kinase in translational control and its potential 
physiological significance. 

A family of protein kinases regulates translation in response to different cellular stresses 
byphosphorylation of the alpha subunit of eukaryotic initiation factor-2 (eEF-2alpha). In yeast, an 
5 eIF-2aIpha kinase, GCN2, functions in translational control in response to amino acid starvation. 
It is thought that uncharged tRNA that accumulates during amino acid limitation binds to 
sequences in GCN2 homologous to histidyl-tRNA synthetase (HisRS) enzymes, leading to 
enhanced kinase catalytic activity. Given that starvation for amino acids also stimulates 
phosphorylation of eDF-2alpha in mammalian cells, we searched for and identified a GCN2 

10 homologue in mice. Sood etal., 2000 cloned three different cDNAs encoding mouse GCN2 
isoforms, derived from a single gene, that vary in their amino-terminal sequences. Like their 
yeast counterpart, the mouse GCN2 isoforms contain HisRS-related sequences juxtaposed to the 
kinase catalytic domain. While GCN2 mRNA was found in all mouse tissues examined, the 
isoforms appear to be differentially expressed. Mouse GCN2 expressed in yeast was found to 

1 5 inhibit growth by hypcrphosphorylation of eIF-2alpha, requiring both the kinase catalytic 

domain and the HisRS-related sequences. Additionally, lysates prepared from yeast expressing 
mGCN2 were found to phosphorylate recombinant eBF-2alpha substrate. Mouse GCN2 activity 
in both the in vivo and in vitro assays required the presence of serine-51 , the known regulatory 
phosphorylation site in eIF-2alpha. Together, those studies identify a new mammalian eEF- 

20 2alpha kinase, GCN2, that can mediate translational controlJPhosphorylation of the alpha subunit 
of eukaryotic initiation factor 2 (eIF-2alpha) is a well-characterized mechanism regulating ; 
protein synthesis in response to environmental stresses (Y ang et al., Mol Cell Biol; 20(8):2706- 
17; Apr, 2000) . In the yeast Saccharomyces cerevisiae, starvation for amino acids induces 
phosphorylation of eIF-2alpha by Gcn2 protein kinase, leading to elevated translation of GCN4, 

25 a transcriptional activator of more than 50 genes. Uncharged tRNA that accumulates during 
amino acid limitation is proposed to activate Gcn2p by associating with Gcn2p sequences 
homologous to histidyl-tRNA synthetase (HisRS) enzymes. Given that eIF-2alpha 
phosphorylation in mammals is induced in response to both carbohydrate and amino acid 
limitations, we addressed whether activation of Gcn2p in yeast is also controlled by different 

30 nutrient deprivations. It was found that starvation for glucose induces Gcn2p phosphorylation of 
eEF-2alpha and stimulates GCN4 translation. Induction of eIF-2alpha phosphorylation by Gcn2p 
during glucose limitation requires the function of the HisRS-related domain but is largely 
independent of the ribosome binding sequences of Gcn2p. Furthermore, Gcn20p, a factor 
required for Gcn2 protein kinase stimulation of GCN4 expression in response to amino acid 

24 



BNSDOCID: <WO__018l67aA2_L> 



WO 01/81578 



PCT/US01/13578 



starvation, is not essential for GCN4 translational control in response to limitation for 
carbohydrates. These results indicate there are differences between the mechanisms regulating 
Gcn2p activity in response to amino acid and carbohydrate deficiency. Gcn2p induction of 
GCN4 translation during carbohydrate limitation enhances storage of amino acids in the 
5 vacuoles and facilitates entry into exponential growth during a shift from low-glucose to high- 
glucose medium. Gcn2p function also contributes to maintenance of glycogen levels during 
prolonged glucose starvation, suggesting a linkage between amino acid control and glycogen 
metabolism. 



1 0 Uses of the Compositions of the Invention 

The expression pattern, map location and protein similarity information for MOL2 

suggest that it may function like a member of the GCN2 elFaK family. Therefore, the nucleic 
acids and proteins of the invention are useful in potential therapeutic applications implicated, for 
example but not limited to, in various pathologies /disorders as described below: 

15 

• Hyperthyroidism 

• Hypothyroidism 

• Von Hippel-Lindau (VHL) syndrome 

• Alzheimer's disease 
20 • Stroke 

• Tuberous sclerosis 

• Hypercalcemia 

• Parkinson's disease 

• Huntington's disease 
25 • Cerebral palsy 

• Epilepsy 

• Lesch-Nyhan syndrome 

• Multiple sclerosis 

• Ataxia-telangiectasia 
30 • Leukodystrophies 

• Behavioral disorders 

• Addiction 

• Anxiety 

• Pain 
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• Cirrhosis 

• Transplantation 

• and/or other pathologies/disorders. 



5 Potential therapeutic uses for the invention(s) are, for example but not limited to, 

the following: (i) Protein therapeutic, (ii) small molecule drug target, (iii) antibody target 
(therapeutic, diagnostic, drug targeting/cytotoxic antibody), (iv) diagnostic and/or prognostic 
marker, (v) gene therapy (gene delivery/gene ablation), (vi) research tools, and (vii) tissue 
regeneration in vitro and in vivo (regeneration for all these tissues and cell types composing these 

10 tissues and cell types derived from these tissues). 

The nucleic acids and proteins of the invention arc useful in potential therapeutic 
applications implicated in various diseases and disorders described below and/or other 
pathologies and disorders. For example, but not limited to, a cDNA encoding the GCN2 eEFaK 
-like protein may be useful in gene therapy, and the GCN2 elFaK -like protein may be useful 

1 5 when administered to a subject in need thereof. By way of nonlimiting example, the 

compositions of the present invention will have efficacy for treatment of patients suffering from 
Hyperthyroidism, Hypothyroidism, Von Hippel-Lindau (VHL) syndrome, Alzheimer's disease, 
Stroke, Tuberous sclerosis, Hypercalcemia, Parkinson's disease, Huntington's disease, Cerebral 
palsy, Epilepsy, Lesch-Nyhan syndrome, Multiple sclerosis, Ataxia-telangiectasia, 

20 Leukodystrophies, Behavioral disorders, Addiction, Anxiety, Pain, Neuroprotection Cirrhosis, 
Transplantation and/or other pathologies/disorders. The novel nucleic acid encoding the GCN2 
elFctK-like protein, and the GCN2 elFaK -like protein of the invention, or fragments thereof, 
may further be useful in diagnostic applications, wherein the presence or amount of the nucleic 
acid or the protein are to be assessed. These materials are further useful in the generation of 

25 antibodies that bind immunospecifically to the novel substances of the invention for use in 
therapeutic or diagnostic methods. 



MOL3 

An additional protein of the invention, referred to herein as MOL3, is a human 
30 complement C3-like protein. The novel nucleic acid was identified by TblastN using CuraGen 
Corporation's sequence file for MOL probe or homolog, run against the Genomic Daily Files 
made available by GenBank. The nucleic acid was further predicted by the program GenScan™, 
including selection of exons. These were further modified by means of similarities using 
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BLAST searches. The sequences were then manually corrected for apparent inconsistencies, 
thereby obtaining the sequences encoding the fiill-Icngth protein. The novel nucleic acid of 4894 
nucleotides (82254077.0.1 , SEQ ED NO:5) encoding a novel olfactory receptor-like protein is 
shown in Table 3A. An open reading frame (ORF) was identified beginning with an ATG 
5 initiation codon at nucleotides 1-3 and ending with a TGA codon at nucleotides 4837-4839. 
Putative untranslated regions downstream from the termination codon are underlined in Table 
3 A, and the start and stop codons are in bold letters. 



Table 3A. MOL3 Nucleotide Sequence (SEQ ID NO:5) 

atctccccgtctctccctttctctgccatctccctatgtctctcccatctccctatgcctctcccatctccaagg 

tacatcctggtgaccccccgagttctgagggttggcagtccggagagcattcacattcaggcccactcagactcc 

agacascccctcacaaggaccctcaaggtgaacctc^cagtgtgggacttccccatgaggaagacagtgttggca 

aggagccagctcattctctcaccaggaaacaactttatggaccaggcacctgtgacggttcccgagagcctgatg 

7acctcccaaaaccagggcagcaatatgtgatcatccgggcaacttgggcacccacctcgggctcctcattcatg 

gagaagatggtgctggtggctcttcatgctggctacatctttatccagacggagaagaccatctacaccccttct 

cccctagttcactaccgggtgttcactotgaaccac7vagatggaccctgtgaccaggacattcactctggacatc 

aagaatcctgatgggtccccagcttccagagtccttgtccactcccaagaccaacctggggtgcrtgcactgtct 

tggggccctgaaaagtccctgtgcctcagtttggggacctggaccatcgaagccagctacca^gtac^cccaag 

cagaagttcgaggctgcctttgatgtgaaggaatatgtcctcccatcttttgaggtccagctggtcccaaataag 

acrttcttttacctcaaggatgaggctctgggcgttgacatccaggctcggtatatatttaacaagccagtggac 

ggacatgctttggtcatctttggggtgaaattggactcctgccggatccctatccaaagctccctgcagagggtg 

gaggtgactgaaacagggggtgagatggtgcaagctgagacctcaggggtgaagatcatccagagcccatacaac 

atcaagttcaccaggacaccccagtatttcaagccaggaatgcccttccactttcgggtcagagrcgtacaaagc 

agtcctattcagatcatattccagtctcacctctcacaccaggccactgcaggcttttccttcaccttaccccag 

attccacctcaggtcttcatctcaaatcctgatgggtccccagcttccagagtccttgtccactcccaagaccaa 

/itvagtgtacacctcagctgaggggttggccactctgaccatcaacacagatgcaaatctggacaagctccccatc 

gaggtgw^actgaggaatctcttcagccagaggagcaggcttcagccaagatgacagcttggccttacttgact 

caggatgggtcaggaaacttcctacacatcgaagtaaagacattgggcacagaggttggcagcagcatccagctg 

agcctcaacacaaggcatcaggaccctaaaaccaaggacaagattactcacttcaccatcctggrggtcagggag 

ggtaaggcgcggcagcttgggaggcaggtggcgcaggtgggggtaccctccttccgciattctggccrtttattta 

cttccc ag aggagcaagccaag accctgagtt ggtggct gattcc at atggattgatgtgaatgac agatgcat a 

gggctgaaagttggcttgaagaatgatagattcttccagtctttggagcccaacagccaagtcgaactgaaggtg 

acaggtgatgcagaagccacagtggggctggtggctgtggacaaggctgtctatgtcttgaacagcaaacacaag 

ctcactcagaagaaggta?ggaatgtggtggaggaacatgacattggctgcacaggaggaagtgggaaagacaga 

tttgctgtgttcaaggatgctggattggacctgaaaatcagcacaggaatggatagcggccaccagcaaagtcac 

agctgccagg aggctg ag gt gggagaatc act tg aacccgggaggcagaggttgcggtcagccg agat t gcgcca 

ctgcactccagcgtgaacaagtttaagacagagctggagcaaaagtgctgtgaggctgggctccgggagagccca 

gtggggctgtcttgtgaggagaggacctggcatgtccgccatggtccagcctgtgtggctgctttcctggactgc 

t g ct c ac acctgc t cc ct ccagcggat gaagaag aggacttcgat g ac ct ctt cttggat g acat gcct gtgcgg 

accttgttccccgagagttggctctggaacagt^tctcccattaccccatctctgtga^ 

accacgtggcagtttgtggtggtcagcctcaaggctgga 

aca£ttatgaaatcgttctttgtggaccttaagttgccctcctccgtgatc^ 

gccatgttgtacaatttcagggatcgccaggccaaggtccgagtggag'rtcccccacaaggagacactgtcscagt 

gcgtcaaagccaggagcaccatcccaccaggtagtggtcgtlsccccccacctccrrcca^ 

cttctccctctggagacaggcaaagtggacgtggaggtcaaggctgtgggctac^ 

aagacactcttggtcgaaggttgtg<n^attcaggtcagacccagacaaaactgct 

aacatggtacccg^cacggaggcggaagtgtttatcagtgttcaaggtgacatccttggtgagacaattgt^^ 

agcctgacacccagtgag attc agcagct gctgcgggt ccccacgggctgccctga5cagacgct gagctccctg 

acg^ccgtcatcatcctgtcccgctatttggatacc^ccggccagtggggcaaggtcggg 

caggtgatg aagaatatt ggctac actcagatgctg acccaccggagtt cagacgg c acct acc acacc tccaag 

ggg aaccc aggaagc act tgg ctcacaagctatgtgtt ccgcgtctttgccctggc ct actct atgatgacgacc 

caagtgctitvgcctgtcctctctctgtgacatggccaactggatc^ 

ttcctggagaagggccctgtggtcatgacatccatgtccgaggaggat^ 

gccctgaatgagggaaaggagttgtgcagacagaaggtaggacccaatttgatggcc^gcatcgagaaggccg^ 

ggattgcttgagctcaggaggttgaggctgcagcggagctatgccgtagccatagcctcctatgcactggccgac 

aaaacccactggccagtggatgagcagaatctgggctccctgtacaccattgaggcc^ 

cagaagctggagctgggccggtacaatgagacacacgccatagccaagtggctact^ 

ggaggcttcaggtcc^cccagccaggkiaggagc^gtcgcctttcccac^ 

ntggcccttgaagctctgacccgcttccgcgaagctgtccccttcaagx;gcatccaggatc 

ag agc c cc c aagacagcc ct g aatgtgaattggt acattg atcacagc aatgcct acc aacagcggtcagcaaag 
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TT CCT TGCCC AGG ACG AC CT AG AGAT C AAAGCC AGTGGC AAC GGG AGAGG CACCATCT CGATCCTG ACAATGT AT 
CACAAGTCCCCAGAGTCCCGGGAGGACAACTGCAACCTGTACCACCTGAATGCGACTCTCCACAGTGCCCTAGAA 
GAAAATAAAAAGGGAGCTTGAGACTTTTCGGCTCCGaATGGAAAC^ 
ACTATCATGGAGGTCTCCCTGCTCACGGGCTTCTACCCCAACC^^ 

GAGAGGTACGCCTTTCAGTACAAAACCAAGACAAGTACCAGCGACAGCACTGTTGTCCTCTACCTGGAAAAGCTC 
TCCCATGAGAAGAACACGGTGCTGGGCTTTCGGGTTCACAGGATGCTGCAGGCGGAGTTCCTGCAGGCCGCCCTG 
GTGACCATCTACGACTACTACGAGCCTTCCCGGAGGTGCAGCACTTTCTACAACCT 

CTGAGAAAGATCTGCCACAAAGACATCTGCAGATGTGCAGAGGGACAGTGCCCATCCCTGCAGAAGCCCACT 

C AATTGAGGCAGGAGG AG CT C CAGJ\C AACAGCATGTG AGGCAGGCGT GGATTT T GTGT ACJ^GACAAAGCTGGAA 

TCTGTGGAGGTCTCTGCCTCCAACCCTTACG^CTATTACAACACGCAGCTCGAAGACATCATTAAGAGTGGTACG 

GACCCrGCCAAACCCCTGGCCATGAAGAAATTCGrCTCCCATGCCACT?GCCATGACTCCCTGGGGTTGCAAGAA 

CAGGAATCCTACCTCATCATGGG^CAGACGTCAGACCTGTGGAGAATCAAATCTGATTACAGCTATGTTCTGGGC 

AAGGAGACGTTCCTCATCCTTTG^X^GCAGATGGAGATGCCAGCAAGAAAGAATrGCGGGACCAACTGGAGG^ 

TTTTTGGAATATATGCGCACCCACGGCTGCCAGTCCTC A^ 

CTCTGGGCC ACGTGGGTT T 



The disclosed MOL3 polypeptide (SEQ ID N0:6) encoded by SEQ ID N0:5 is 1612 
amino acid residues, and is presented using the one-letter code in Table 3B. The MOL3 protein 
were analyzed for signal peptide prediction and cellular localization. Signal? results predict that 
5 MOL3 is cleaved between position 20 and 21 of SEQ ID NO:6. Psort and Hydropathy profiles 
also predict that MOL3 contains a signal peptide and is likely to be localized at the endoplasmic 
reticulum (certainty of 0.5500). 



Table 3B. Encoded MOL3 protein sequence (SEQ ID NO:6). 

MSPSLPFSAISLCLSHLPMPLPSPRYILVTPRVIjRVGSPESIHIQAHSDS^ 

ARSQLILSPG£ftJF^QAPVTVPESLMYLPKPGCX2WI IRATWAPTSGSSFME^r\^VALHAGYIFIOTEKTiyT 

PSPLVHYRVFTVNHKMDPVTRTFTLD^ 

TPKQKFEAAFDVKEYVLPSFEVQLVPNKTFFYLKDEAI^ 

LQRVEVTETGGEMVQAETSGIVKI IQSPYNIKFTRTPQYFKPGMPFHFRVRVVQSSPIQI IFQSHLSHQATAGFS 
FTLPQI PPQVFI SWPDGS PASRVLTOSQDQKVYTSAEGLATLTI NTDANLDKLP I E VKTEESLQPBEQASAKMT 
AWP YLTQDGSGNFLH I EVKTLGTEVGSS I QLSUraUJQDPKTKDKITHFTILVVREGKARQIiGRQVAQVOVPSF 
RIIAFYIjLPRGASQDPELVADSIWIDVNDRCIGLKVGLKNDRFFQSLEPNSQVELKVTC 
YVLNSKHKLTQKiCVWNVVEEHDIGCT^ 

RLRS AE I APLHSSVNKFKTEXEQKCCEAGLRE S PVGLSCEERTWHVRHGPACVAAFLDCCSBIIiLPPADEEEDFD 
DLFLDDMP VRTLFP E S WLWNS I SHYP I S VKVPDS I TTKQF VWS LKAGQGGLCVSD P FE LTVMKS FFVDLKLP S 
SVIRNEQVQIQAMLYNFRDRG^KTOVEFPHKETLCS^ 
KAVGYGVQDHVXKTLLVEGCGYSG<^QTKLTORQEF1^^ 
VPrGCPEC/TLSSLTPVIII^RYIiDTTGQWGKVGVEHRDQVMKNIGYTQMLTHRSS 

VFRVFALAYS MMTTQVLS LS SLCDMANW 1 1 IDRQAEDGHFLEKGPVVMTSMSEEDVSIiTAIjVLIALNEGKELCR 
QKVG PNLMAS I E KAGGLLELRRLRI^RS YAVAI ASYALADKTHWP VDE QNI£>SL YT I EATAYGLMQKLELGR YN 
ETHAIAJCWLLEKQE1X^FR5TQPGRSSRI>SHPQRWP0GSUCALE 
VNVrriDHSNAYroRSAKFLAQDDLEIKASG^ 

T FRLRMETRFQNNGEATMTI ME VS LLTGF WNQDDlJCQLTSDVERYAFQYKTKTSTSDSTVVLYliEKLSHE WIT 

VLGFRVHRMLG^FXQAALVTIYDYYEPSRRCSTFYNLPra 

ELQTTACTEAGVDFVYKTKLESVEVSA^PYVYYNTQLEDIIK^ 

LIMGOTSDLWRIKSDYSYVIjG1CETFLILWPADGDA5KKELRI>QLEEFLEYMRTHGCQS 



The full amino acid sequence of the protein of the invention was found to have 257 of 
10 734 amino acids (35 %) identical and 403 of 734 (54%) homolog to a Cavia porcellus (guinea 
pig) complement C3 precursor (contains: C3A anaphylatoxin) (ACC:P12387; 1666 aa), and 255 
of 717 amino acid residues (35 %) identical to, and 401 of 717 residues (55 %) similar to, the 
1663 amino acid residue complement C3 precursor (contains: C3A anaphylatoxin) from Homo 
sapiens (human) (ACC:P01024). 
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The disclosed MOL3 protein (SEQ ID NO:6) also has good identity with a number of 
complement compontent proteins, as shown in Table 3C. 



Table 3C. BLAST results for MOL3 


Gene Index/ 
Identifier 


Protein/ 
Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi|4557385|ref |NP 
J)00055.11 


complement 
component 3 
precursor [Homo 
sapiens] 


1663 


633/1750 
(36%) 


938/1750 
(53%) 


0.0 


gi|11869931|gb|AA 
G40565.1IAF154933 
1 (AF154933) 


complement 
component C3 
[Sus scrofa] 


1661 


617/1733 
(35%) 


935/1733 
(53%) 


0.0 


gi|309122|gblAAC4 
2013.11 (K02782) 


preprocomplemen 
t component C3 
[Mus musculus] 


1663 


606/1734 
(34%) 


915/1734 
(51%) 


0.0 



5 This information is presented graphically in the multiple sequence alignment given in 

Table 3D (with MOO being shown on line 1) as a ClustalW analysis comparing MOL3 with 
related protein sequences. 



Table 3D. Information for the ClustalW proteins: 

10 1) Novel MOL3 (SEQ ID NO: 6) 

2) gi | 4 557 38 5 1 ref |NP_000055. 1 1 complement component 3 precursor [Homo sapiens J 
(SEQ ID NO:41) 

3) git 11869931 | gb | AAG40565 . 1 1 AF154933_1 (AF154 933) complement component C3 [Sus 
scrofa] (SEQ ID NO: 42) 

15 4) gi|309122|gbiAAC42013.1| (K02782) preprocomplement component C3 [Mus 

musculus] (SEQ ID NO: 43) 



HDL 3 Prote 
NP_000055. 
AF154933 
K02782 



HDL 3 Prote 
NP_O0O0S5. 
AF154933 
K02782 



M0L3 Prote 
NP_000055. 
ATI 54 933 
K02782 



M0L3 Prote 
MP_OOO0S5. 
ATI 54933 
K02782 




250 



260 



270 



280 
• •I . 



290 

..I • 



300 
..I 
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MOL3 Prote 
NP 000055. 
AF154933 
K02782 



M0L3 Prote 
NP 000055. 
AF154933 
K02782 



M0L3 Prote 
NP_000055. 
AF154933 
K027B2 * 



M0L3 Prote 
MP_000055 . 
AF154933 
K02782 



MDL3 Prote 
NP_000055. 
AF154933 
K02782 



M0L3 Prote 
MP_000055. 
AF154933 
K02782 



M0L3 Prote 
NP 000055. 
AF154933 
K02782 



M013 Prote 
NP_00005S . 
AF154933 
K02782 



MOL3 Prote 
KP_00005S. 
AF154933 
K02782 



I 

M0L3 Prote TV^gIIp] 
MP_000055. 
AF154933 Q5 1 ■ 
K02762 



1210 



1220 



1230 



1240 



1250 



1260 



....I. ...I 

rdfbadySle H.k0p| 




|. ...I I I 1 I I I 

sEng dEthw pvd'eqnlg sf ffi ift 

[USDkB DERFgNjLgS *j 



iLMNK 





1300 1310 

.1 I .... I . 

jQPG RSSRLSHPQR W! 





1350 1360 1370 13B0 

!....( . ... I .... t ....(....( 
[KTALWV NWYgDHS^Y QQ^AKFL Ag pDLEIKgSgS! 
Kl [jHWHjg U^BabWyEM HGgTVlSE£ 

,py rhMl BBEB SJSSEen Ett 11 



1390 



1400 



1410 



1420 



1430 



1440 



1346 
1396 
1394 
1396 




1570 



1580 



1590 



1600 



1610 



1620 
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MOL3 shows significant homologies to human complement C3 proteins, as described in, 
but not limited to, the references below. 
5 It was found that transforming growth factor-betal acts as a potent inhibitor of 

complement C3 biosynthesis in human pancreatic cancer cell lines. Andoh et al. determined 
how transforming growth factor (TGF)-bctal affects complement C3 secretion in the pancreatic 
cancer cell lines PANC-1 and BxPC-3. It is suggested that TGF-betal may act as a potent 
inhibitor of C3 secretion in pancreatic cancer cell lines under inflammatory conditions. This 

10 action of TGF-betal did not correlate with NF-kappaB activation, but associated with the 
translocation of Fos protein into the nucleus. 

The cellular localization of complement C3 and C4 transcripts were analyzed in intestinal 
specimens from patients with Crohn's disease. It has been suggested that the increase in C3 and 
C4 levels in jejunal perfusates of patients with Crohn's disease results from local intestinal 

1 5 synthesis of complement. Laufer et al. suggest that there is local regulated production of 

complement in the intestine of patients with CD, and subsequent complement activation may 
contribute to the inflammatory process. 

The generation of complement C3 and expression of cell membrane complement 
inhibitory proteins by human bronchial epithelium cell line. They found that the interrelationship 

20 between human airway epithelium and complement proteins may affect airway defence, airway 
function, and airway epithelial integrity. Local generation of complement C3 and expression of 
cell membrane CEP by human bronchial epithelium and its modulation by proinflammatory 
cytokines might be an additional regulatory mechanism of local airway defence and may affect 
airway function and epithelial integrity in health and disease. 

25 Janssen et al, Am J Kidney Dis 2000 Jan;35(l):21-8 suggested the activation of the acute 

phase response and complement C3 in patients with IgA nephropathy. The authors have shown 
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ysteraic complement activation in patients with immunoglobulin A (IgA) nephropathy and 
reported that plasma levels of actC3 can indicate disease activity and renal outcome. 

Therapeutic applications 

5 

The expression pattern, and protein similarity information forMOL3 may function as a 
human complement C3-like protein. Therefore, the nucleic acid and protein of the invention are 
useful in potential therapeutic applications implicated, for example but not limited to, cancer, 
lung diseases, including asthma, immundeficiencies, inflammation, Crohn's disease, neurological 
10 disorders, nephropathy, and other diseases and disorders. The homology to antigenic secreted 
and membrane proteins suggests that antibodies directed against the novel genes may be useful 
in treatment and prevention of cancer, lung diseases, including asthma, immundeficiencies, 
inflammation, Crohn's disease, neurological disorders, nephropathy, and other diseases and 
disorders. 

1 5 Potential therapeutic uses for the inventions) are, for example but not limited to, the 

following: (i) Protein therapeutic, (ii) small molecule drug target, (iii) antibody target 
(therapeutic, diagnostic, drug largeting/cytotoxic antibody), (iv) diagnostic and/or prognostic 
marker, (v) gene therapy (gene delivery/gene ablation), (vi) research tools, and (vii) tissue 
regeneration in vitro and in vivo (regeneration for all these tissues and cell types composing these 

20 tissues and cell types derived from these tissues. 

The nucleic acids and proteins of the invention are useful in potential therapeutic 
applications implicated in cancer, lung diseases, including asthma, immundeficiencies, 
inflammation, Crohn's disease, neurological disorders, nephropathy, and other diseases and 
disorders. For example, but not limited to, a cDNA encoding the human complement C3-like 

25 protein may be useful in gene therapy, and the human complement C3-like protein may be useful 
when administered to a subject in need thereof. By way of non-limiting example, the 
compositions of the present invention will have efficacy for treatment of patients suffering from, 
for example, but not limited to, cancer, lung diseases, including asthma, immundeficiencies, 
inflammation, Crohn's disease, neurological disorders, nephropathy, and other diseases and 

30 disorders. The novel nucleic acid encoding the human complement C3-like protein, and the 

human complement C3-like protein of the invention, or fragments thereof, may further be useful 
in diagnostic applications, wherein the presence or amount of the nucleic acid or the protein are 
to be assessed. These materials are further useful in the generation of antibodies that bind 
immunospccifically to the novel substances of the invention for use in therapeutic or diagnostic 

35 methods. 
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MOL4 

The disclosed Wnt 8-like protein, MOL4 (also referred to herein as AC004826), is 
encoded by a nucleic acid, 1064 nucleotides long (SEQ ID NO:7). An open reading frame was 
5 identified beginning with an ATG initiation codon at nucleotides 4-6 and ending with a TG A 
codon at nucleotides 1057-1059. Putative untranslated regions upstream from the initiation 
codon and downstream from the termination codon are underlined in Table 4A, and the start and 
stop codons are in bold letters. The encoded protein having 351 amino acid residues is presented 
using the one-letter code in Table 4B (SEQ ID NO:8). 

10 



Table 4A. MOL4 Nucleotide Sequence (SEQ ID NO:7). 

GCG ATGGGGAACCTGTTTATGCTCTGGGCAGCTCTGGGCATATGCTGTGCTGCATTCAGTGCCTCTGCCTGGTCR 
GTGAACAATTTCCTGATAACAGGTCCCAAGGCCTATCTGACCTACACGACTAGTGTGGCCTTGGGTGCCCAGAGT 
GGCATCGAGGAGT3CAAGTTCCAGTTTGCTTGGGAACGCTGGAACTGCCCTGAAAATGCTCTTCAGCTCTCCACC 
CACAACAGGCTGAGAAGTGCTACCAGAGAGACTTCCTTCATACATGCTATCAGCTCTGCTGGAGTCATGTACATC 
ATCACC AAG AAC TGTAGC ATGGGTG ACT TCGAAAACTGTGGCTGTGATGGGT CAAACAATGGAA&AAT AGGAGGC 
CATGGCTGG ATCTGGGGAGGCTGCAGCGACAATGTG GAATTTGGGGAAAGGATCTCC AAACT CTTTGT GG ACAGT 
T TGGAG AAGGGG AAGGATGCCAGAGC CCTG ATGAAT CT TCAC AACAACAGGGCCGGC AG ACT GGCAGTGAGAG CC 
ACCATGAAAAGGACATGCAAATGTCATGGCATCTCTGGGAGCTGCAGCATACAGACATGCTGGCTGCAGCTGGCT 
G AATT CCGGG AG ATGGG AGACT AC CT AAAGGCCAAGT ATGACC AG GCGCT G AAAATT GAAAT GG ATAAGCGG C AG 
CTGAGAGCTGGGAACAGCGCCGAGGGCCACTGGGTGCCCGCTGAGGCCrTTCCTTCCTASCGCAGAGGCGGAACTG 
ATCTTTTTAGAGGAATCACCAGATTACTGTACCTGCAATTCCAGCCTGGGCATCTATG3CACAGAGGG?CGTGAG 
TGCCTACAG7WVCAGCCACAACACATCCAGGTGGGAGCGACGTAGCTGTGGGCGCCTGTGCACTGAGTGTGGGCTG 
CAGGTGGAAGAGAGGAAAACTGAGGTCATAAGCAGCTGTAACTGCAAATTCCAGTGGTGCTGTACGGTCAAGTGT 
GACCAGTGTAGGCATGTGGTGAGCAAGTATTACTGCGCACGCTCCCCAGGCAGTGCCCAGTCCCTGGGTAAGGGC 
AGTGCCTGATAATA 



The disclosed nucleic acid MOL4 sequence has 881 of 1050 bases (83%) identical to a 
Mus musculus Wnt 8 mRNA (GENBANK-ID: MMWNT8DPT|acc:Z68889) and 637 of 955 
bases (66%) identical to a Homo sapiens Wnt 8 mRNA (GENBANK-ID: 
15 HSWNT8|acc:YI1094). 

The MOL4 polypeptide (SEQ ID NO:8) encoded by SEQ ID NO:7 is presented using the 
one-letter amino acid code in Table 4B. The Psort profile for MOL4 predicts that this sequence 
has a signal peptide and is likely to be localized outside the cell with a certainty of 0.7700. The 
most likely cleavage site for a MOL4 peptide is between amino acids 24 and 25 based on the 
20 SignalP result. 



Table 4B. MOL4 protein sequence (SEQ ID NO:8) 

MGN L FMLWAALG I CCAAFS AS AWS VNN FL I TG PKA YLT YTT S VALGAQSG IE E C K FQ FAW E RWN C PENALQLS THN 
RI*RSATRETSFIHAISSAGVMYIITKNCSMGDFENCGCIX3SNNGKIGGHGWIWGGCSDNVEFt3ERISKLFVDSLEK 
GKDARALMNLHNNRAGRIiAVRATMKRTCKCHGISGSCSIO/TCWLQIJ^ 

NSAEGHWVPAEAFLPSAEAELIFl^ESPDYCTCNSSI^IYGTEGRECLQNSHNTSRWERRSCGRI^TECGLQ 
KTEVI SSCNCKFQWCCTVKCDQCRH WS KYYCARSPGSAQSLGKGSA 



34 



BNSOOCID: <WO_0181678A2_L> 



WO 01/81578 



PCT/US0 1/1 3578 



The full amino acid sequence of the disclosed MOL4 polypeptide has 282 of 349 amino 
acid residues (80%) identical to, and 306 of 349 residues (87%) positive with, the 354 amino 
acid residue WNT-8D protein from Mus musculus (ptnr:SPTREMBL*ACC: Q64527), (E = 
1.5xl0 16 ) 

5 BLASTP (Non-Redundant Composite database) analysis of the best hits for alignments 

with MOL4 arc listed in Table 4C. 



Table 4C. BLASTP results for MOL4 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 

(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi|11693046| 
gbl AAG3B662. 
11 

(AY0O94O2) 


WNTGd precursor [Homo 
sapiens] 


355 


335/348 
(96%) 


336/348 
(96%) 


0.0 


gi|6678169|r 
eflNP 033316 
.11 


stimulated by retinoic 
acid gene 11 [Mus 
musculus] 


354 


271/349 
(77%) 


295/349 
(83%) 


l.Oe- 
148 


gi|104264|pi 
r| IS18771 


de ve lopmental 
regulator Xwnt-8 - 
African clawed frog 


387 


246/335 
(73%) 


285/335 
(84%) 


l.Oe- 
136 


gi|1722B44 |s 
p|P5103O|WND 
C CHICK 


WNT-8C PROTEIN 
PRECURSOR (CWNT-8) 


357 


242/337 
(71%) 


283/337 
(83%) 


l.Oe- 
134 



1 0 This information is presented graphically in the multiple sequence alignment given in 

Table 4D (with MOL4 being shown on line 1) as a ClustalW analysis comparing MOL4 with 
related sequences. 

Table 4D Information for the ClustalW proteins: 



15 1) MOL4 (SEQ ID N0:8) 

2) gijll693046|gb|AAG38662.1| (AY009402) WNT8d precursor [Homo sapiens] (SEQ 
ID NO:44) 

3) gil6678l69|ref |NP_033316.1| stimulated by retinoic acid gene 11 [Mus 
musculus] (SEQ ID NO; 45) 

20 4) gi|104264 ipir| IS18771 developmental regulator Xwnt-8 - African clawed frog 

(SEQ ID NO:46) 

5) gi|1722844|sp|P51030|WN8C_CHICK WNT-6C PROTEIN PRECURSOR (CWNT-8) (SEQ ID 
NO;47) 



25 



30 



35 



10 

.1 . 



MOL4 Prote -^NgaSjffl EgHSgSs Q 

AY009402 -^Nm3j5J BBSS 3 

NP_Q33316. -EhEi53Sv EfJEgY^G 

S18771 KQNTTIiFXLjJ TBLHFSPEgT 

P51030 KBgSTgLjjLS, IV^YGQiLN 



MOL4 Prote 
AY009402 
NP 033316. 





RETSFIH1MSSKGVMY 
RETSnHlAISSAGVMY 
RETS FX H 
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15 



20 



25 



30 



35 



40 



S18771 
P51030 



KOL4 Prote Qjlfe 
AY009402 
NPJ>33316. 
S18771 
P51030 



MOL4 Prote 

AY009402 

NP_033316. 

S18771 

P51030 



MOL.4 Prote 

AY009402 

NP_033316. 

E18771 

P51030 



MOL4 Prote 
AY009402 
NP 033316. 
S18771 
P51030 



MOM Prote 
AY009402 
NP 033316. 
S18771 
P51030 




350 
I 

P SLAKES* 351 

P Sg2 SLWJVWFGW 354 

V ggR PF£Q?KDSAK 354 

ERDSNMLNTK RKN^HQEKT 360 

D-AAVANHTK RRNKKHRE — 357 



370 
..I - 



380 



351 

t 355 

; _ 354 

AFQKNSPTST RASGFVCLQD FFLNLWT 387 
357 



45 WNT genes encode intercellular signaling glycoproteins that play important roles in key 

processes of embryonic development such as mesoderm induction, specification of the 
embryonic axis, and patterning of the central nervous system, spinal cord, and limbs. The name 
WNT denotes the relationship of this family to the Drosophila segment polarity gene 'wingless, 1 
and to its vertebrate ortholog Intl , a mouse protooncogene; see WNT1 . It was noted that multiple 

50 WNT genes are known to exist in several species that have been investigated ranging from 

Drosophila to man. They have been classified into various groups and subgroups on the basis of 
high sequence homology and common expression patterns. The vertebrate WNT8 subfamily 
includes genes from Xenopus, zebrafish, and chicken; The first mammalian WNT8 homolog, a 
human member of the Wnt8 family that they termed WNT8B was characterized on the basis of 

55 the very high sequence similarity (90-91% identity) of the inferred protein to those encoded by 
the Xenopus and zebrafish Wnt8b genes. The human cDNA encodes a 295-amino acid 
polypeptide that contains a C2I12 zinc finger-like motif. A predominant 1.9-kb mRNA was 
detected in a variety of adult and fetal tissues. They used PCR typing of a human 
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monochromosomal hybrid cell panel to map the gene to chromosome 10, and fluorescence in situ 
hybridization for localization at 10q24. 

The full-length cDNA sequence and genomic organization of the human WNT8B gene 
was presented and reported studies of expression of the gene in human and mouse embryos. The 
5 WNT8B gene contains six exons separated by small introns, with the exception of intron 1 . The 
predicted protein has 351 amino acids. The gene is expressed predominantly as a transcript of 
approximately 2. 1 kb. The human and mouse expression patterns appeared to be identical and 
were restricted to the developing brain, with the great majority of expression being found in the 
developing forebrain. In the latter case, expression was confined to the germinative 

10 neuroepithelium of three sharply delimited regions: the dorsomedial wall of the telencephalic 
ventricles (which includes the developing hippocampus), a discrete region of the dorsal 
thalamus, and the mammillary and Tetromammillary regions of the posterior hypothalamus. 
Expression in the developing hippocampus may suggest a role for WNT8B in patterning of this 
region, and subchromosomal localization of the human gene to 10q24 may suggest it as a 

1 5 candidate gene for partial epilepsy (EPT; OMIM-6005 1 2) in families in which the disease has 
been linked to markers in this region. 

WNT1 is a member of a family of cysteine-rich, glycosylated signaling proteins that 
mediate diverse developmental processes such as the control of cell proliferation, adhesion, cell 
polarity, and the establishment of cell fates. Wntl was identified as an oncogene activated by the 

20 insertion of mouse mammary tumor virus in virus-induced mammary adenocarcinomas. 

Although Wntl is not expressed in the normal mammary gland, expression of Wntl in transgenic 
mice causes mammary tumors. To identify downstream genes in the WNT signaling pathway 
that are relevant to the transformed cell phenotype, A PCR-based cDNA subtraction strategy was 
used, suppression subtractive hybridization. It was reported that the identification of two genes, 

25 WISP1 and WISP2, that are upregulated in the mouse mammary epithelial cell line transformed 
by Wntl, but not by Wnt4. Together with a third related gene, WISP3, these proteins define a 
subfamily of the connective tissue growth factor family. Two distinct systems demonstrated 
WISP induction to be associated with the expression of WNTL WISP1 genomic DNA was 
amplified in colon cancer cell lines and in human colon tumors and its RNA overexpressed in 

30 84% of the tumors examined compared with patient-matched normal mucosa. WISP3 also was 
overexpressed in 63% of colon tumors analyzed. In contrast, WISP2 showed reduced RNA 
expression in 79% of the tumors. These results suggested that WISP genes may be downstream 
of WNT1 signaling and that aberrant levels pf WISP expression in colon cancer may play a role 
in colon tumorigenesis. 
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It was found that the W1SP1 cDNA encodes a 367-amino acid protein. Mouse and human 
WISP1 proteins are 84% identical; both have hydrophobic N-terminal signal sequences, 38 
conserved cysteine residues, and 4 potential N-linked glycosylation sites. Alignment of the three 
human WISP proteins showed that WISP1 and WISP3 are most similar (42%), whereas WISP2 
5 had 37% identity with WISP1 and 32% identity with WISP3. 

Uses of the Compositions of the Invention 

The above defined information for this invention suggests that MOL4 may function as a 
member of the "Wnt 8 family". Therefore, the novel nucleic acids and proteins identified here 

10 may be useful in potential therapeutic applications implicated in (but not limited to) various 
pathologies and disorders as indicated below. The potential therapeutic applications for this 
invention include, but are not limited to: protein therapeutic, small molecule drug target, 
antibody target (therapeutic, diagnostic, drug targeting/cytotoxic antibody), diagnostic and/or 
prognostic marker, gene therapy (gene delivery/gene ablation), research tools, tissue regeneration 

15 in vivo and in vitro of all tissues and cell types composing (but not limited to) those defined here. 
The nucleic acids and proteins of the invention are useful in potential therapeutic 
applications implicated in neurodegenerative disorders, epilepsy, cancers including but not 
limited to brain tumor, colon cancer and breast cancer, developmental disorders, neural tube 
defects, and/or other pathologies and disorders. For example, a cDNA encoding the Wnt 8-like 

20 protein may be useful in gene therapy, and the Wnt 8-like protein may be useful when 

administered to a subject in need thereof. By way of nonlimiting example, the compositions of 
the present invention will have efficacy for treatment of patients suffering from 
neurodegenerative disorders, epilepsy, cancers including but not limited to brain tumor, colon 
cancer and breast cancer, developmental disorders, and neural tube defects,. The novel nucleic 

25 acid encoding Wnt 8-like protein, and the Wnt 8-like protein of the invention, or fragments 

thereof, may further be useful in diagnostic applications, wherein the presence or amount of the 
nucleic acid or the protein are to be assessed. These materials are further useful in the generation 
of antibodies that bind immunospecifically to the novel substances of the invention for use in 
therapeutic or diagnostic methods. 

30 

MOL5 

The disclosed novel Beta Thymosin-like MOL5 nucleic acid of 21 5 nucleotides (also 
referred to as AC025535) is shown in Table 5A. An ORF begins with an ATG initiation codon at 
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nucleotides 4-7 and ends with a TGA codon at nucleotides 211-213. A putative untranslated 
region upstream from the initiation codon and downstream from the termination codon is 
underlined in Table 5A, and the start and stop codons are in bold letters. 



Table 5A. MOL5 Nucleotide Sequence (SEQ ID NO:9) 

AGTA TGGTCT CAGC CCAG CGTTTCACGAGTCTTCAAG C CTTC&GGCTTTCTTTAATCAAGATG AGTGATAA CCC 
AAACTTGTCAGAAGTGAAGTTTGACAGGTCA^ 

CCAAGGAAACTATC CAGCAGGAG AA AT ACGGTGTTC AAACATCATAT AATGGGGG CTGGGCATGATG 



The MOL protein encoded by SEQ ED NO:9 has 69 amino acid residues and is presented 
using the one-letter code in Table 5B. The Psort profile for MOL5 predicts that this sequence 
has a signal peptide and is likely to be localized at the mitochondrial intermembrane space with a 
certainty of 0.8800. Using the SIGN ALP analysis, the protein of the invention does not appear to 
1 0 contain a predictable signal peptide. 



Table 5B. Encoded MOL5 protein sequence (SEQ ID NO:10) 

MVSAQRFTSLQAFRI^LIKHSDNPKLSEVKFDRSKL 



The disclosed nucleic acid sequence for MOL5 hasl67 of 191 bases (87%) identical to a 
Homo sapiens Beta Thymosin mRNA (GENBANK-ID: D82345|acc:D82345) (E= 5.1e -26 ). 
15 The full MOL5 amino acid sequence has 37 of 45 amino acid residues (82%) identical to, 

and 38 of 45 residues (84%) positive with, the 45 amino acid residue Thymosin beta protein 
from Homo sapiens (ptnr: PIR-ID:JC5274) (E= 1.2e" n ). 

MOL5 also has homology to other proteins as shown in BLAST alignment results in 
Table 5C. 



Table 5C. BLAST results for MOL5 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


ref |NP_068832.1| 


thymosin, beta, 
identified in 
neuroblastoma 
cells [Homo 
sapiens] 


45 


37/45 
(62%) 


38/45 
(84%) 


5e-06 


pir | | 152084 


thymosin beta-4 
precursor - rat 
(fragment) 


56 


27/39 
(69%) 


33/39 
(84%) 


2e-04 


sp 1 P20065 1 TYB4_MOUS 
E 


THYMOSIN BETA-4 


50 


27/39 
(69%) 


33/39 
(84%) 


3e-04 


gb IAAA3674 6.il 
(M92383) 


thymosin beta-10 
[Homo sapiens] 


49 


24/40 
(60%) 


32/40 
(80%) 


0.002 


gb|AAB37101.11 
(025684) 


thymosin beta- 
like protein. 
[Rattus 
norvegicus] 


45 


31/39 
(79%) 


34/39 
(86%) 


0.002 
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This information is presented graphically in the multiple sequence alignment given in 
Table 5D (with MOL5 being shown on line .l) as a ClustalW analysis comparing MOL5 with 
related protein sequences. 

Table 5D Information for the ClustalW proteins: 



10 



15 



1) MOL5 (SEQ ID NO: 10) 

2) ref |NP_068832.1 | thymosin, beta, identified in neuroblastoma cells (Homo 
sapiens] (SEQ ID N0:48) 

3) pir| | 152084 thymosin beta-4 precursor - rat (fragment) (SEQ ID NO: 4 9) 

4) sp|P20065|TYB4_MOUSE THYMOSIN BETA-4 (SEQ ID NO:50) 

5) gb IAAA36746.il (M92383) thymosin bet a- 10 [Homo sapiens) (SEQ ID NO: 51) 

6) gb IAAB37101 . 1 1 (U25684) thymosin beta-like protein [Rattus norvegicus] (SEQ 
ID N0:52) 



10 



20 



M0L5 Prote 

MP_068832. 

X52084 

P20065 

M923B3 

U256B4 



....I.... I I I 

MVSAQRETSL QAFRLELIkJ 

LFA QLAQLLFAt! 

mllfat! 

DCFKKP 



DCFKK^ 




70 

.... |... . 1 

M0L5 Prote VQTSYNGGWA 69 

NP_068832. VQTS 45 

152084 GES 56 

P20065 GES 50 

M92383 EES' 49 

U25584 NQRS 4b 



Thymosin-beta-4 induces the expression of terminal deoxynucleotidyl transferase activity 
20 in vivo and in vitro, inhibits the migration of macrophages, and stimulates fee secretion of 
hypothalamic luteinizing hormone-releasing hormone. It was noted that the protein was 
originally isolated from a partially purified extract of calf thymus, thymosin fraction 5, which 
induced differentiation of T cells and was partially effective in some immunocompromised 
animals. Further studies demonstrated that the molecule is ubiquitous; it had been found in all 
25 tissues and cell lines analyzed. It is found in highest concentrations in spleen, thymus, lung, and 
peritoneal macrophages. It was stated that thymosin-beta-4 is an actin monomer sequestering 
protein that may have a critical role in modulating the dynamics of actin polymerization and 
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depolymerization in nonmuscle cells. Its regulatory role is consistent with the many examples of 
transcriptional regulation of T-beta-4 and of tissue-specific expression. Lymphocytes have a 
unique T-beta-4 transcript relative to trie ubiquitous transcript found in many other tissues and 
cells. It was stated that rat thymosin-beta-4 is synthesized as a 44-amino acid propeptide which is 
5 processed into a 43-amino acid peptide by removal of the first methionyl residue. The molecule 
does not have a signal peptide. Human thymosin-beta-4 has a high degree of homology to rat 
thymosin-beta-4; the coding regions differ by only 9 nucleotides, and these are all silent base 
changes. 

By differential screening of a cDNA library prepared from leukocytes of an acute 
1 0 lymphocytic leukemia patient, a cDNA encoding thymosin-beta-4 was isolated. Using Northern 
blot analysis, the expression of the 830-nucleotide thymosin-beta-4 mRNA in various primary 
myeloid and lymphoid malignant cell lines and in hemopoietic cell lines was studied. It was 
stated that the pattern of thymosin-beta-4 gene expression suggests that it may be involved in an 
early phase of the host defense mechanism. 

15 A cDNA clone for the human interferon-inducible gene 6-26 was isolated and showed 

that its sequence was identical to that for the human thymosin-beta-4 gene. By use of a panel of 
human rodent somatic cell hybrids, it was shown that the 6-26 cDNA recognized seven genes, 
members of a multigene family, present on chromosomes 1, 2, 4, 9, 1 1, 20, and X. These genes 
are symbolized TMSL1 , TMSL2, etc., respectively. Li et al. (1996) established that in the mouse 

20 there is a single Tmsb4 gene and that the lymphoid-specific transcript is generated by extending 
the ubiquitous exon 1 with an alternate downstream splice site. By interspecific backcross 
mapping, they located the mouse gene, which they symbolized Ptmb4, to the distal region of the 
mouse X chromosome, linked to Btk and Gja6. Thus, the human gene could be predicted to 
reside on the X chromosome in the general region of Xq21.3-q22, where BTK is located. By 

25 analysis of somatic cell hybrids, the thymosin-beta-4, or TB4X, gene were mapped to the X 
chromosome. They noted that a homologous gene, TB4Y, is present on the Y chromosome. 

It was stated that prostate carcinoma is the most prevalent form of cancer in males and 
the second leading cause of cancer death among older males. The use of the serum prostate- 
specific antigen test permits early detection of human prostate cancer; however, early detection 
30 has not been accompanied by an improvement in determining which tumors may progress to the 
metastatic stage. The process of tumor metastasis is a multistage event involving local invasion 
and destruction of extracellular matrix; intravasation into blood vessels, lymphatics or other 
channels of transport; survival in the circulation; extravasation out of the vessels into the 
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secondary site; and growth in the new location. Common to many components of the metastatic 
process is the requirement for tumor cell motility. A well-characterized scries of cell lines that 
showed varying metastatic potential was developed from the Dunning rat prostate carcinoma. A 
direct correlation between cell motility and metastatic potential in the Dunning cell lines was 
5 shown. In studies comparing gene expression in poorly and highly motile metastatic cell lines 
derived from Dunning rat prostate carcinoma using differential mRNA display, Bao et al. (1996) 
found a novel member of the thymosin-beta family of actin-binding molecules. The molecule, 
named thymosin-beta- 15 by them, was found to deregulate motility in prostate cells directly. In 
addition, it was expressed in advanced human prostate cancer specimens, but not in normal 

10 human prostate or benign prostatic hyperplasia, suggesting its potential use as a new marker for 
prostate carcinoma progression. Bao et al. (1996) found that thymosin-beta-15 levels correlated 
positively with the Gleason tumor grade. Coffey (1996) pointed out that the upregulation of 
thymosin-beta-15 as a positive motility factor and the down regulation of the motility suppressor 
KAI1 (OMIM- 600623) provide the 'yin and yang' for metastasis; he speculated that these 

15 pathways may provide a new target for therapy. 

Angiogenesis is an essential step in the repair process that occurs after injury. In a study, 
the angiogenic thymic peptide thymosin beta4 (Tbeta4) enhanced wound healing in a rat full 
thickness wound model was examined. Addition of Tbeta4 topically or intraperitoneally 
increased reepithelialization by 42% over saline controls at 4 d and by as much as 61% at 7 d 

20 post-wounding. Treated wounds also contracted at least 1 1% more than controls by day 7. 

Increased collagen deposition and angiogenesis were observed in the treated wounds. We also 
found that Tbeta4 stimulated keratinocyte migration in the Boyden chamber assay. After 4-5 h, 
migration was stimulated 2-3-fold over migration with medium alone when as little as 10 pg of 
Tbeta4 was added to the assay. These results suggest that Tbeta4 is a potent wound healing 

25 factor with multiple activities that may be useful in the clinic. 

Uses of the Compositions of the Invention 

The above defined information for this invention suggests thatMOLS may function as a 
member of a "Beta Thymosin family". Therefore, the novel nucleic acids and proteins identified 
30 here may be useful in potential therapeutic applications implicated in (but not limited to) various 
pathologies and disorders as indicated below. The potential therapeutic applications for this 
invention include, but are not limited to: protein therapeutic, small molecule drug target, 
antibody target (therapeutic, diagnostic, drug targeting/cytotoxic antibody), diagnostic and/or 
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prognostic marker, gene therapy (gene delivery/gene ablation), research tools, tissue regeneration 
in vi\>o and in vitro of all tissues and cell types composing (but not limited to) those defined here. 

The nucleic acids and proteins of the invention are useful in potential therapeutic 
applications implicated in cancer including but not limited to prostate cancer, immunological 
5 and autoimmune disorders {i.e., hyperthyroidism), angiogenesis and wound healing, modulation 
of apoptosis, neurodegenerative and neuropsychiatry disorders, age-related disorders, and other 
pathological disorders involving spleen, thymus, lung, and peritoneal macrophages and/or other 
pathologies and disorders. For example, a cDNA encoding the Beta Thymosin-like protein may 
be useful in gene therapy, and the Beta Thymosin-like protein may be useful when administered 

10 to a subject in need thereof. By way of nonlimiting example, the compositions of the present 
invention will have efficacy for treatment of patients suffering from cancer including but not 
limited to prostate cancer, immunological and autoimmune disorders (Le., hyperthyroidism), 
angiogenesis and wound healing, modulation of apoptosis, neurodegenerative and 
neuropsychiatric disorders, age-related disorders, and other pathological disorders involving 

1 5 spleen, thymus, lung, and peritoneal macrophages. The novel nucleic acid encoding Beta Thymosin- 
like protein, and the Beta Thymosin-like protein of the invention, or fragments thereof, may 
further be useful in diagnostic applications, wherein the presence or amount of the nucleic acid 
or the protein are to be assessed. These materials are further useful in the generation of 
antibodies that bind immuno specifically to the novel substances of the invention for use in 

20 therapeutic or diagnostic methods. 

MOL6 

MOL6a 

The disclosed novel Trypsin-like MOL6a nucleic acid of 730 nucleotides (also referred to 
as GM_87760758_A) is shown in Table 6A. An open reading begins with an ATG initiation 
25 codon at nucleotides 8-10 and ends with a TGA codon at nucleotides 713-715. A putative 
untranslated region upstream from the initiation codon and downstream from the termination 
codon are underlined in Table 6A, and the start and stop codons are in bold letters. 



Table 6A. MOL6a Nucleotide Sequence (SEQ ID NO: 11) 

GRTCACCA TGAARTATGTCTTCTATTTGGGTGTCCTCGCTGGGACATTTTTCTTTGCTGACTCATCTGTTCAGAA 

AGAAGACCCTGCTCCCTATTTGGTGTACCTCAAGTCTCACTTCAACCCCTGTGTGGGCGTCCTCJVTCAAACCCAG 

CTGGGTGCTGGCCCCAGCTCACTGCTATTTACCAAATCTGAAAGTGATGCTGGGAAATITCAAGAGCAGAGTCAG 

AGACGGTACT G AACAGACAAT T AACCCCATTCAGATCGTCCGCT ACTGG AACT ACAGTCATAGCGCCCCACAGGA 

TGACCTCATGCTCATCAAGCTGGCTAAGCCTGCCATGCTCAATCCCAAAGTCCAGCCCCTTCCCCTCGCCACCAC 

CAATGTCAGGCCAGGCACTGTCTGTCTACTCTCAGGTTTGGACTGGAGCCAAGAAAACAGTGGCCGACACCCTGA 

CTTGCGGCAGAACCTGGAGGCCCCCGTGATGTCTGATCGAGAA 

GAATTCCTTATGTGTGAAATTTGTGAAAGTATTCAGCCGAA^ 
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CAT^GACAAGCTCCAGGGAATCGAGGTGGGGCACTTCATGGG^ 

ATATGTATCCTGGATTGAGAACACTGCTAAGGACAAGTGAGACCCrACTTCTCCC 



The disclosed nucleic acid sequence has 354 of 581 bases (60%) identical to aMus 
miisctilus prepro-Trypsininogen mRNA (GENBANK-ID: MMTRYAR|acc:X04574) (E value = 
9.9e- 24 ). 

5 The MOL6a protein encoded by SEQ ED NO: 1 1 has 235 amino acid residues, and is 

presented using the one-letter code in Table 6B (SEQ ID NO:12). The Psort profile for MOL6a 
predicts that this sequence has a signal peptide and is likely to be localized on the outside with a 
certainty of 0.3700. The most likely cleavage site for a peptide is between amino acids 1 9 and 
20, ADS-SV based on the SignalP result. 

10 



Table 6B. Encoded MOL6a protein sequence (SEQ ID NO:12). 

MKWFYLGVIAGTFFFADSSVQKEDPAPYTiVYLKSHFNPCVG\^ 
EQTINPIQIWYWNYSHSAPQDDLMLIKIAKFAMLN 
LEAPVMSDRECQKTEQGKSHRNSLCVKFVKVFSRIFGEVA 
ENTAKDK 



The full amino acid sequence of MOL6a was found to have 79 of 208 amino acid 
residues (37%) identical to, and 1 18 of 208 residues (56%) positive with, the 248 amino acid 
residue TRIPSINOGEN I-Pl PRECURSOR (EC 3.4.21.4) protein from Gallus gallus (ptnr: 
15 SWISSNEW--ACC: Q90627) (E value = Lie- 33 ). 

MOL6 also has high homology to the proteins shown in the BLAST data in Table 6C. 



SNP analysis of MOL6a is described in Example 2. 



Table 6C. BLAST results for MOL6a 


Gene Index/ 
Identifier 


Protein/ Organism 


Length ; 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi|2499B62|sp|Q9062 
7|TRY1 CHICK 


TRYPSIN I -PI 
PRECURSOR 


248 


79/208 
(37%) 


116/208 
(55%) 


2e-31 


gil 2118087 Ipirl |S55 


trypsin (EC 
3.4.21.4) I 
precursor, 
pancreatic - 
chicken 


248 


78/20B 
(37%) 


117/208 
(55%) 


5e-31 


gi |6678439|ref INP 0 
33456.11 


trypsin 2 [Mus 
musculusj 


246 


77/215 
(35%) 


115/215 
(52%) 


8e-29 


gi!1633123|pdb|lSLW 
IB 


Chain B> Rat 
Anionic Nl43h, 
E151h Trypsin 
Complexed To AB6h 
Ecotin 


223 


74/212 
(34%) 


116/212 
(53%) 


le-28 



MOL6b 

20 In the present invention, the target sequence identified previously, MOL6a, Accession 

Number GM_87760758_A, was subjected to the exon linking process to confirm the sequence. 
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PCR primers were designed by starting at the most upstream sequence available, for the forward 
primer, and at the most downstream sequence available for the reverse primer. In each case, the 
sequence was examined, walking inward from the respective termini toward the coding 
sequence, until a suitable sequence that is either unique or highly selective was encountered, or, 
5 in the case of the reverse primer, until the stop codon was reached. Such suitable sequences 
were then employed as the forward and reverse primers in a PCR amplification based on library 
containing a wide range of cDNA species. The resulting amplicon was gel purified, cloned and 
sequenced to high redundancy to provide the sequence reported below, which is designated 
MOL6b, Accession Number GM_87760758_A_da 
10 The disclosed novel Trypsin-like MOL6b nucleic acid of 730 nucleotides (also referred 

to as GM_87760758_A_da) is shown in Table 6D. An open reading frame begins with an ATG 
initiation codon at nucleotides 8-10 and ends with a TGA codon at nucleotides 713-715. A 
putative untranslated region upstream from the initiation codon and downstream from the 
termination codon are underlined in Table 6A, and the start and stop codons are in bold letters. 

15 



Table 6D. MOL6b Nucleotide Sequence (SEQ ID NO: 13) 

GATCACCAT&AAA.TATGTCTTCTATTTC 

AAGAAGACCCTGCTCCCTATTTGGTGTACCTCAAGTCTCACTTCAACCCC^ 

AG CTGGGTGCTGGCC CCAQ CTC ACTG ATTTACCAAAT CTGAAAGTGATGCTGGGAAATTT CAAGAGCAGAGT 

CAGAGACXSGTACTGAACAGACAATTAACCCCATTCA^ 

AGGATGACCTCATGCTCATCAAGCTGGCTAAGCCTGC 

ACCACCAATGTC^GCCCAGGCACTGTCTGTCTA 

CCCTGACTTG CGGCAG AAC CTGGAGGCCCCCGTGATGT CTG ATCGAGAATG C CAAAAAACAGAACAAGGAAAAA 
GCCACAGGAATTCCT1ATGTGTGAAATTTGTGAAAGTATT CAGCCGAATTTTTGGGGAGGTGG CCGTTGCTACT 
GTCAT CTG CAAAGACAAGCTCCAGG GAATCGAGGTGGGG CACTTCATGGGAGGGGAOGTCGG CATCTACACCAA 
TGTTT ACAAAT ATGTAT C CTGG ATTG AG AACACTG CT AAGGACAAGTQ AG AC CCT ACTT CTC C C 

The MOL6b protein encoded by SEQ ID NO:13 has 235 amino acid residues, and is 
20 presented using the one-letter code in Table 6E (SEQ ID NO: 14). The Psort profile for MOL6a 
predicts that this sequence has a signal peptide and is likely to be localized on (he outside with a 
certainty of 0.3700. The most likely cleavage site for a peptide is between amino acids 19 and 
20 based on the SignalP result. 



Table 6E- Encoded MOL6b protein sequence (SEQ ID NO:14). 

MKYVFYLGVIAGTFFFADSSVQKEDPAPYLVXLKSHFNPCVGVLIK^ 

EG/TINPIQIVRYV^YSHSAPQDDUGiIKI^ 

LEAPVMSDRECQKTEC£KSHRNSI£^KFVKVFSRIFGEVA^ 

EHTAKDK • 

45 



oiei57eA2 i > 



WO 01/81578 



PCT/US01/13578 



The full amino acid sequence of MOL6b was found to have homology with several 
proteins including those disclosed in the BLASTP data in Table 6F. 



Table 6F. BLAST results for MOL6b 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
<%) 


Expect 


ACC:Q90627 


TRYPSIN I-Pl 
PRECURSOR (EC 
3.4.21.4) - 
Gallus gallus 
(Chicken) 


246 


79/208 
(37%) 


110/208 
(56%) 


3.8e- 
34 


PIR-ID:S55067 


trypsin (EC 
3.4.21.4) I 
precursor, 
pancreatic - 
chicken 


248 


78/208 
(37%) 


117/208 
(55%) 


7.9e- 
34 


ACC:Q90628 


TRYPSIN I-P38 
PRRCURSOR (EC 
3.4.21.4) - 
Gallus gallus 
(Chicken) 


248 


78/208 
(37%) 


117/208 
(56%) 


l.Oe- 
33 


ACC:P07477 


TRYPSIN I 
PRECURSOR (EC 
3.4.21.4) 
(CATIONIC 
TRYPSINOGEN) - 
Homo sapiens 
(Human) 


247 


76/212 
(35%) 


112/212 
(52%) 


1.3e- 
31 



5 

MOL6b also has high homology to the proteins shown in the BLASTX alignment data in 
Table 6G. 



Table 6G. BLASTX results for MOL6b 






Smallest 






Sum 


•Reading 


High 


Prob 


Sequences producing High- scoring Segment Pairs: ' Frame 


Score 


P(N) N 


ptnr:SWISSPROT-ACC:Q90627 TRYPSIN I-Pl PRECURSOR (EC 3. +2 


372 


2.3e-33 1 


ptnr :PIR-ID:S55067 trypsin (EC 3.4.21.4) I precursor, .+2 


369 


4.7e-33 1 


ptnr:SWISSPROT-ACC:Q90628 TRYPSIN I-P38 PRECURSOR (EC .+2 


368 


6.0e-33 1 


ptnr:SWISSPROT-ACC:P0714 6 TRYPSIN II, ANIONIC PRECURSO. +2 


350 


4.9e-31 1 


ptnr : SWISS PROT-ACC : P07477 TRYPSIN I PRECURSOR (EC 3. 4.. +2 


348 


7.9e-31 1 


ptnr:SPTREM3L-ACC:Q9R0T7 PANCREATIC TRYPSIN - Mus muse. +2 


348 


7.9e-31 1 


ptnr:SWISSPROT-ACC:P15951 TRYPSIN III PRECURSOR (EC 3.. +2 


347 


1.0e-30 1 



This information is presented graphically in the multiple sequence alignment given in 
10 Table 6H (with MOL6a being shown on line 1 and MOL6b being shown on line 2) as a 
ClustalW analysis comparing MOL6 with related protein sequences. 

Table 6H Information for the ClustalW proteins: 



46 



BNSDCCID: <WO 0181678A2_L> 



WO 01/81578 



PCT/IJ SO 1/13578 



1) MOb6a (SEQ ID NO: 12) 

2) M0L6b (SEQ ID NO: 14) 

3) gi I 249986? I sp | Q90627 |TRY1_CHICK TRYPSIN I-Pl PRECURSOR (SEQ ID NO: 53) 

4) gi 1 213.8087 |p.ir | IS55067 trypsin (EC 3.4.21.4) I precursor, pancreatic - 
5 chicken (SEQ ID NO: 54) 

5) gi|6678439|ref |NP_0334S6 . 1 | trypsin 2 [Mus musculus] (SEQ ID NO:55) 

6) gi|1633123|pdb|lSLW|B Chair. B, Rat Anionic N143h, E151h Trypsin Complexed To 
A86h Ecotin (SEQ ID NO: 56) 



MOL6a Prot 
M0L6b Prot 
090627 
S55067 
KP 033456. 
1633123 



MDL6a Prot 
M0L6b Prot 
090627 
S55067 
NP 033456. 



_j 1 ... I ....I.... I 

SyvfylgvE agtffSad|- 

SYVFYiLGVL AGTFFlADj 

3FLVLVAFV GVTVA 3PI * 

gFLVLVAFL GVAVA3?l| 

_SALIil![iAI/V GAAVAgPVD- 



I ....I- 
S VQK0Dtl 
S VQKgDPp 
D EDDDKIVGGY SCARSAE 
0 EDDDKIVGGY SCARS 4 
DDDKIVGGY TCR 




MDL6a Prot 

MOLSb Prot 

Q90627 

S55067 

KP 033456. 

1633123 



190 200 210 
|.. ..| ...,|....| ....|....| . 
SHRNSlgVKF VKVFSRIFGE VAVAlffilSSD Kg 
SHRNSlaVKF VKVFSRIFGE VAVAlfilggD Kfi 
0 IGY gNGGKDSCQG DSGgPffvgNG of 
flSGGKDSCQG DSC-gpBvgNG Qf 
53GGKDSCQG DSGgpBvgWG e| 
KFVTAY gS DAgMLRYKS KgP 



I . 



ITSNMJ 
ITSKMI 2IGY 
ITNNMlgVGF 



220 
■ I 



1e — y PhfmgSdi 
3 E — V §hfmg|di 

jVSWGJ tCAQKP'/P 
t-VSWGI §CAQK£YP 
JVSKGY eCAQPDA? 
-J WYTPDNV1 
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MOL6b also contained several single polynucleotide polymorphisms described in Table 

61. 



Table 61. SNP for MOL6b 


Position 


Nucleotide Change 


Number of Occurrences 


70 


OA 


2 


70 


OG 


6 


■261 


T>C 


2 


406 


A>C 


2 



47 



BNSOOCIO: <WO 01BIS78A2_I_> 



WO 01/81578 



PCT/US01/13578 



573 


G>T 


2 


585 


OT 


2 


737 


A>G 


4 



Trypsin (EC 3.4.21.4 ), like elastase, is a member of the pancreatic family of serine 
proteases. The gene encoding trypsin- 1 (TRY1) is also referred to as serine protease- 1 (PRSS1). 
MacDonald et al. (1982) reported nucleotide sequences of cDNAs representing 2 pancreatic rat 
5 trypsinogens. Using a rat cDNA probe, Honey ct al. (1984, 1984) found that a 3.8-kb DNA 
fragment containing human trypsin-1 gene sequences cosegregated with chromosome 7, and 
assigned the gene further to 7q22-7qter by study of hybrids with a deletion of this segment. The 
trypsin gene is on mouse chromosome 6 (Honey et al., 1984). Carboxypeptidase A and trypsin 
are a syntenic pair conserved in mouse and man. Emi et al. (1986) isolated cDNA clones for two 

10 major human trypsinogen isozymes from a pancreatic cDNA library. The deduced amino acid 
sequences had 89% homology and the same number of amino acids (247), including a 15-amino 
acid signal peptide and an 8-amino acid activation peptide. Southern blot analysis of human 
genomic DNA with the cloned cDNA as a probe showed that the human trypsinogen genes 
constitute a family of more than ten, some of which may be pseudogencs or may be expressed in 

1 5 other stages of development. 

Rowen et al. (1996) found that there are eight trypsinogen genes embedded in the beta T 
cell receptor locus or cluster of genes (TCRB; OMIM- 186930), which maps to 7q35. In the 685- 
kb DNA segment that they sequenced they found five tandemly an-ayed 10-kb locus-specific 
repeats (homology units) at the 3-prime end of the locus. These repeats exhibited 90 to 91% 

20 overall nucleotide similarity, and embedded within each is a trypsinogen gene. Alignment of 

pancreatic trypsinogen cDNAs with the gerrnline sequences showed that these trypsinogen genes 
contain five exons that span approximately 3.6 kb. Further analyses revealed 2 trypsinogen 
pseudogenes and one relic trypsinogen gene at the 5-prime end of the sequence, all in inverted 
transcriptional orientation. They denoted eight trypsinogen genes Tl through T8 from 5-prime to 

25 3-prime, Rowen et al. (1 996) found that only two of three pancreatically expressed trypsinogen 
cDNAs correspond to trypsinogen genes in the TCRB locus; T4 was denoted trypsinogen 1 and 
T8 was denoted trypsinogen 2 (OMIM- 601564). The third pancreatic cDNA, identified 
independently as trypsinogen 3 (Tani et al., 1990) and 4 (Wiegand et al. f 1993), is distinct from 
the third apparently functional trypsinogen gene (T6) in the TCRB locus but related to the other 

30 pancreatic trypsinogens. Rowen et al. (1996) stated that the T6 gene is deleted in a common 

insertion-deletion polymorphism; if it is functional, its function is apparently not essential. Some 

48 



BNSDOCID: <WO 0181678A2_I_> 



wo 



01/81578 



PCT/USO 1/1 3578 



of the trypsinogen genes are expressed in nonpancreatic tissues where their function is unknown. 
Rowen et al. (1996) noted thai the intercalation of the trypsinogen genes in the TCRB locus is 
conserved in mouse and chicken, suggesting shared functional or regulatory constraints, as has 
been postulated for genes in the major histocompatibility complex (such as class 1, II, and HI 
5 genes) that share similar long-term organizational relationships. 

Rowen et al. (1 996) mapped the gene corresponding to the third pancreatic trypsinogen 
cDNA by fluorescence in situ hybridization. They used a cosmid clone containing 3 trypsinogen 
genes. Strong hybridization to chromosome 7 and weaker hybridization to chromosome 9 were 
observed. They isolated and partially sequenced 4 cosmid clones from the chromosome 9 region. 

10 They found that the region represents a duplication and translocation of a DNA segment from the 
3-prime end of the TCRB locus that includes at least seven V(beta) elements and a functional 
trypsinogen gene denoted T9. The assignment of the PRSS1 gene to 7q35 is established by the 
demonstration of its sequence within the sequence of the 'locus' (OMIM- 186930) for the T-cell 
receptor beta-chain (Rowen et al., 1996). It is further supported by the linkage between 

15 microsatellite markers in the 7q35 region and hereditary pancreatitis (OMIM- 167800) and the 
demonstration of mutations in the PRSS1 gene in hereditary pancreatitis. 

Whitcomb et al. (1996) stated that the high degree of DNA sequence homology (more 
than 91%) present among this cluster of five trypsinogen genes identified by Rowen et al. (1 996) 
demanded that highly specific sequence analysis strategies be developed for mutational 

20 screening in families with hereditary pancreatitis. This was necessary to ensure that each 

sequencing run contained only the two alleles corresponding to a single gene, thereby permitting 
detection of heterozygotes in this autosomal dominant disorder, and not a dozen or more alleles 
from multiple related trypsinogen-like genes, which would make detection of heterozygotes 
nearly impossible. In a family with hereditary pancreatitis, Whitcomb et al. (1996) found that 

25 affected individuals had a single G-to-A transition mutation in the third exon of cationic 
trypsinogen (276000.0001). This mutation was predicted to result in an argl05-to-his 
substitution in the trypsin gene (residue number 1 17 in the more common chyrnotrypsin number 
system). Subsequently, the same mutation was found in a total of five different hereditary 
pancreatitis kindreds (four from the U.S. and one from Italy) containing a total of 20 affected 

30 individuals and six obligate carriers. The mutation was found in none of the obligate unaffected 
members (individuals who married into the family). Subsequent haplotyping revealed that all 
four of the American families displayed the same high risk haplotype over a 4-cM region 
encompassing seven STR markers, confirming the likelihood that these kindreds shared a 
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common ancestor, although no link could be found through eight generations. A fifth family 
from Italy displayed a unique haplotypc indicating that the same mutation had occurred on at 
least 2 occasions. The G-to-A mutation at codon 1 17 created a novel enzyme recognition site for 
Afllll which provided a facile means to screen for the mutation. As with the obligate unaffected 
5 members of the pancreatitis kindreds, none of 140 controls possessed the G-to-A mutation as 
assayed by the lack of Afllll digestion of the amplified exonic DNA. 

Failure to thrive, nutritional edema, and hypoproteinemia with normal sweat electrolytes 
were features of 2 affected male infants reported by Townes (1965) and Townes et al. (1967). A 
protein hydrolysate diet was beneficial. A male sib of the first patient reported by Townes (1 965) 
10 had died, apparently of the same condition. Morris and Fisher (1967) reported an affected female 
who also had imperforate anus. The clinical picture in enterokinase deficiency (OMIM- 226200) 
is closely similar; however, the defect is not in the synthesis of trypsinogen but in the synthesis 
of the enterokinase which activates proteolytic enzymes produced by the pancreas. Oral 
pancreatin represents a therapeutically successful form of enzyme replacement (Townes, 1972). 

1 5 Since hereditary pancreatitis has been mapped rather precisely to 7q35 and since a defect in the 
trypsinogen gene has been identified in hereditary pancreatitis, the assignment of the trypsinogen 
gene can be refined from 7q32-qter to 7q35. 

Ferec et al. (1999) studied 14 families with hereditary pancreatitis and found mutations in 
the PRSS1 gene in 8 families. In 4 of these families, the mutation (R117H; 276000.0001) had 
20 been described by Whitcomb et al. (1996). Three novel mutations were described in 4 other 
families. 

Sahin-Toth et al. (1999) studied the roles of the Rl 17H and N21I (276000.0002) 
mutations in hereditary pancreatitis. They stated that the Rl 17H mutation is believed to cause 
pancreatitis by eliminating an essential autotytic cleavage site in trypsin, thereby rendering the 

25 protease resistant to inactivation through autolysis. Sahin-Toth et al. (1999) demonstrated that 
the Rl 1 7H mutation also significantly inhibited autocatalytic trypsinogen breakdown under 
Ca(2+)-free conditions and stabilized the zymogen form of rat trypsin. Taken together with 
findings demonstrating that the N21I mutation stabilized rat trypsinogen against autoactivation 
and consequent autocatalytic degradation, the observations suggested a unifying molecular 

30 pathomechanism for hereditary pancreatitis in which zymogen stabilization plays a central role. 

Uses of the Compositions of the Invention 
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The above defined information for this invention suggests that this Trypsin-like protein 
may function as a member of a "Trypsin family*'. Therefore, the novel nucleic acids and proteins 
identified here may be useful in potential therapeutic applications implicated in (but not limited 
to) various pathologies and disorders as indicated below. The potential therapeutic applications 
5 for this invention include, but are not limited to: protein therapeutic, small molecule drug target, 
antibody target (therapeutic, diagnostic, drug targeting/cytotoxic antibody), diagnostic and/or 
prognostic marker, gene therapy (gene delivery/gene ablation), research took, tissue regeneration 
in vivo and in vitro of all tissues and cell types composing (but not limited to) those defined here. 
The nucleic acids and proteins of the invention are useful in potential therapeutic 

10 applications implicated in failure to thrive, nutritional edema, and hypoprotememia, tripsinogen 
deficiency disease, chronic and heriditary pancreatitis, enterkinase defieciency, cancer and/or 
related pathologies and disorders and/or other pathologies and disorders. For example, a cDNA 
encoding the Trypsin-like protein may be useful in gene therapy, and the Trypsin-like protein 
may be useful when adrninistered to a subject in need thereof. By way of nonlimiting example, 

15 the compositions of the present invention will have efficacy for treatment of patients suffering 
from failure to thrive, nutritional edema, and hypoproteinemia, trypsinogen deficiency disease, 
chronic and heriditary pancreatitis, enterkinase defieciency, cancer. The novel nucleic acid 
encoding Trypsin-like protein, and the Trypsin-like protein of the invention, or fragments 
thereof, may further be useful in diagnostic applications, wherein the presence or amount of the 

20 nucleic acid or the protein are to be assessed. These materials are further useful in the generation 
of antibodies that bind immunospecifically to the novel substances of the invention for use in 
therapeutic or diagnostic methods. 



MOL7 

25 A novel nucleic acid encoding a Kallikrein-like-protein MOL7 was identified by TblasfN 

using CuraGen Corporation's sequence file for MOL7 probes or homologs, and run against the 
Genomic Daily Files made available by GenBank. The nucleic acid was further predicted by the 
program GenScan™, including selection of exons. These were further modified by means of 
similarities using BIAST searches. The sequences were then manually corrected for apparent 

30 inconsistencies, thereby obtaining the sequences encoding the full-length protein. The disclosed 
novel MOL7 nucleic acid of 1 81 1 nucleotides (also referred to as 30675745.0.499) is shown in 
Table 7 A. An open reading frame begins with an ATG initiation codon at nucleotides 368-370 
and ends with a TAG codon at nucleotides 1553-1 555. A putative untranslated region upstream 
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from the initiation codon and downstream from the termination codon are underlined in Table 
7A, and the start and stop codons are in bold letters. 



Table 7A. MOL7 Nucleotide Sequence (SEQ ID NO:15) 

ACAAATCCTTCTGTTGAACTCTACTGTGTCAGGCCRGCCTGAGTTCATTTCTCCTTGAGCRGGAACAGTT 
CATGGACGAACTCTGAGGACCATTCTGAGGACAAGAGGCATCCAGTGTCATGAGTGGAACATGCAGCATT 
T?ATGGCTACAGAGTTAAGGCAAGGGTTGAATTCCACGAGTCARAAAGCAGCCCTTTTCAGAGACCCAAC 
TCTCTGGGGTGCTCAGGGGCTTGGGCTGGATTGAGAAGAAAACTGACAAGAGTAAGCTGCCCTCTCTTCT 
C?GGCCATCTCACAAACCACAGTGCGGGCCAACTGGTCCTGCCTCTTTACCACACAGAACCAAGCACTAG 
GGATAAGACAGCTGCCCA TGGTGTCCGCGGCGGGTCTCTCTGGGGATGGCAAGATGCGAGGGGTGCTCCr 
GGTGCTGCTCGGCCTTCTCTATTCTTCCACCAGTTGTGGCGTCCAGAAAGCTTCCGTTTTCTACGGTCCT 
GACCCC/vAGGAGGGCTTGGTCAGCAGCATGGAGTTCCCGTGGGTGGTGTCGCTGCAGGACTCCCAGTACA 
CACACCTGGCTTTCGGCTGCATCCTGAGCGAGTTCTGGGTCCTCAGCATCGCATCCGCCATTCAGAACAG 
GAAGGACATTGTCGTTATAGTGGGTATAAGTAACATGGATCCTAGCAAGA'l'TGCTCACACAGAGTATCCA 
GTCAATACCATCATCATCCATGAGGACTTTGATAACAACTCCATGAGCAACAACATAGCCCTCCTGAAGA 
CAGACACAGCGATGCATTTTGGCAACCTGGTCCAGTCCATCTGCTTCCTCGGCAGAATGCTGCATACACC 
ACCAGTCTTGCAGAACTGCTGGGTGTCAGGATGGAATCCCACATCTGCAACAGGAAATCACATGACGATG 
AGTGTCC7GAGGAAAATCTTCGTGAAAGATCTTGACATGTGTCCCCTATACAAACTCCAGAAGACAGAAT 
GCGGCAGCCACACGAAAGAGGAAACCAAGACTGCCTGCTTGGGGGACCCAGGAAGCCCAATGATGTGCCA 
GCTACAGCAGTTCGATCTGTGGGTTCTGAGAGGAATCCTGAACTTCGGTGGTGAGACGTGCCCTGGCCTG 
TTTCTGTACACCAAGSTGGAAGACTACAGCAAATGGATCACATCCAAGGCTGAGAGGGCCGGCCCTCCCC 
TGTCCTCACTCCACCACTGGGAAAAGTTGATTTCTTTCTCCCACCATGGACCAAATGCCGCCATGACACA 
GAAGACATAl'TCTGATTCTGAACTGGGCCATGTTGGATCATACTTGCASGGACAAAGAAGGACCATCACG 
CATTCACGACTAGG7VAACAGCTCTAGAGATAGTCTAGATGTTAGGGAGAAGGATGTAAAGGAATCAGGCA 
GGTCTCCTGAGGCGTCTGTACAACCCTTATACTATGACTATTACGGTGGGGAGGTGGGGGAAGGTAGGAT 
TTTTGCAGGTCAGAACAGGTTGTATCAGCCCGAAGAAATCATCTTGGTTTCCTTCGTGCTTGTTTTCTTT 
TGCAGCAGTATCTAG TCCAGGAGCTACCCCACCAAACTGAAGAGTAAACTGAGAATGCTGAGTGCCAGGC 
ATTCACCATGCTGTTTTGATGTCTGT TTT TGATAGTTGCACACTGGGGCTGCCACGGATAAGCCCATGGC 
ATACACTGGGCTGGCTCTCCCTCCTCTATCCCTCTCCCAGGTGTGGGAAGGTCACTTTCACTATGCTTGT 
GAACTAAATGCTGGCTAACAAGTGTCAAAAAAAAAAAAAAJAAAAAAAAAAAAAAAAAAAA 



The M0L7 protein encoded by SEQ ID NO: 1 5 has 395 amino acid residues, and is 
presented using the one-letter code in Table 7B (SEQ ID NO: 16). The SignalP, Psort and/or 
Hydropathy profile for MOL7 predict that MOL7 has a signal peptide and is likely to be 
localized at the plasma membrane with a certainty of 0. 9190. The SignalP shows a signal 
10 sequence is coded for in the first 44 amino acids with the most likely cleavage site being between 
amino acids 30 and 3 1 . This is typical of this type of membrane protein. The molecular weight 
of MOL7 is 43S15.7 Daltons. 



Table 7B. Encoded MOL7 protein sequence (SEQ ID NO:16). 

MVSAAGLSGIX3KMRGVLLVLLGLLYSSTSCGVQ 

EFWLSIASAIQNRKDIWIVGISNMDPSKIAHTEYPVNT^ 

CFI/3RhtfjHTPPVLQNCWSGWNPTSATC 

SPMCttliQQFDLWVLRGILNFGGETCPGLFLYTKVTO 

TQKTYSDSELGHVGSYIjQGQRRTITHSRIiGNSSE^ 

GQNRLYQEEE 1 1 LVS PVLVFFCS S I 
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M0L7 was found to have 290 of 290 amino acid residues (100 %) identical to the 290 
amino acid residue hypothetical 32.6 kD protein from Homo sapiens (human) 
(ACC:CAB70765). This protein has similarity to kallikrein. 

MOL7 shows significant homologies to human hypothetical 32.6 kD protein (a protein 
5 with similiarities to Kallikrein), as described in, but not limited to, the references below. 

Kallikreins are a subgroup of serine proteases and these proteolytic enzymes have diverse 
physiological functions in many tissues. Growing evidence suggests that many kallikreins are 
implicated in carcinogenesis. The human kallikrein gene family is localized on chromosome 
19ql3.3-ql3.4 and currently includes three members: KLK1 or pancreatic/renal kallikrein, 
10 KLK2 or human glandular kallikrein and KLK3 or prostate-specific antigen (PSA). The latter 

two genes are almost prostate-specific and they arc used for diagnosis and monitoring of prostate 
cancer and more recently, in breast cancer applications (Y ousef et al, Anticancer Res 1999 Jul- 
Aug;19(4B):2843-52). These new genes, like the already known kallikreins, may have utility for 
diagnosis, monitoring and therapeutics of various cancers including those of the breast, prostate 
1 5 and testis. 

Monsees et al, Immunopharmacology 1999 Dec;45(l-3):107-14, found that elements of 
the kallikrein-kinin system are present in rat seminiferous epithelium. Peptide hormones are 
involved in the paracrine regulation of several physiological processes. The paracrine peptide 
system may play a role in the regulation of Sertoli cell function or in the Sertoli cell-germ cell 

20 crosstalk, and therefore, be involved in mammalian reproduction, especially spermatogenesis. 

Chen et al, J Biol Chem 1996 Nov l;271(44):27590-4, found that the kallikrein-kinin 
system participates in blood pressure regulation. One of the kallikrein-kinin system components, 
kallikrein-binding protein, binds to tissue kallikrein and inhibits its activity in vitro. 

The glandular kallikreins are a distinct group of serine proteases with a molecular weight 

25 of 25,000-40,000 and an ability to release vasoactive peptides from kininogen in vitro, although 
the kininogenase activity of different kallikreins is highly variable. The true physiologic role of 
specific kallikreins is often unrelated to the kininogenase activity. In the mouse a major site of 
kallikrein synthesis is the male submaxillary gland. Glandular kallikreins are also synthesized in 
the pancreas and kidney. The several kallikreins found in this tissue include epidermal growth 

30 factor binding protein (EGF-BP) and the gamma subunit of nerve growth factor (NGFG; 
162040) which are responsible for the processing of EGF (131530) and NGF (162030), 
respectively. Although EGF-BP and NGFG exhibit strict substrate specificity, they share 
extensive amino acid sequence homology and immunologic exossreactivity. Mason etal (1983) 
concluded that the glandular kallikrein gene family comprises 25-30 highly homologous genes 
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that encode specific proteases involved in the processing of biologically active peptides. All are 
closely linked on mouse chromosome 7 (assignment by Chinese hamster-mouse hybrid cell 
studies). Several human kallikrein genes have been isolated. 

Schedlich et al (1987) described a human glandular preprokaUikrein gene, hGK-1, 
5 isolated from a human genomic library. The 5.2-kb gene encodes a prepropeptide of 261 amino 
acids. The mature protein is 237 amino acids long and has 66% homology with the sequence 
predicted for the human kallikrein synthesized in pancreas, kidney, and salivary gland (KLK1; 
147910). Seventy-three percent homology with human prostate-specific antigen (APS; 176820) 
was observed. Expression of the glandular kallikrein gene, like that of the APS gene, seems to be 

10 restricted to the prostate. Riegman et al (1989) found that the glandular kallikrein gene and that 
for prostate-specific antigen are aligned in a head-to-tail orientation and are separated by about 
12 kb. Southern blot analysis of DNA from a panel of human-hamster hybrid cells showed that 
the genes are situated on chromosome 19. Since the KLK1 gene is also on chromosome 19, these 
3 genes probably represent a cluster. From in situ hybridization studies, Qin et al (1991) 

15 concluded that the glandular kallikrein gene and probably other kallikrein genes are located in 
ql3.3 and ql3.4 bands of chromosorne 19 and are probably near the border of these two bands. 

Therapeutic applications 

20 The expression pattern, and protein similarity information for MOL7 suggest that it may 

function as human Kallikrein-like protein. Therefore, the nucleic acid and protein of the 
invention are useful in potential therapeutic applications implicated, for example but not limited 
to, various cancers including those of the testis, prostate, and breast; mammalian reproduction, 
especially spermatogenesis; blood pressure regulation; and other diseases and disorders. The 

25 homology to antigenic secreted and membrane proteins suggests that antibodies directed against 
the novel genes may be useful in treatment and prevention of various cancers including those of 
the testis, prostate, and breast; mammalian reproduction, especially spermatogenesis; blood 
pressure regulation; and other diseases and disorders. 

Potential therapeutic uses for the invention(s) are, for example but not limited to, the 

30 following: (i) Protein therapeutic, (ii) small molecule drug target, (iii) antibody target 

(therapeutic, diagnostic, drug targeting/cytotoxic antibody), (iv) diagnostic and/or prognostic 
marker, (v) gene therapy (gene delivery/gene ablation), (vi) research tools, and (vii) tissue 
regeneration in vitro and in vivo (regeneration for all these tissues and cell types composing these 
tissues and cell types derived from these tissues. 
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The nucleic acids and proteins of the invention are useful in potential therapeutic 
applications implicated in various cancers including those of the testis, prostate, and breast; 
mammalian reproduction, especially spermatogenesis; blood pressure regulation;and other 
diseases and disorders. For example, but not limited to, a cDNA encoding the novel human 
5 plasma membrane protein may be useful in gene therapy, and the novel human plasma 

membrane protein may be useful when administered to a subject in need thereof. By way of 
non-limiting example, the compositions of the present invention will have efficacy for treatment 
of patients suffering from, for example, but not limited to, various cancers including those of the 
testis, prostate, and breast; mammalian reproduction, especially spermatogenesis; blood pressure 

10 regulation; and other diseases and disorders. The novel nucleic acid encoding the novel human 
plasma membrane protein, and the novel human plasma membrane protein of the invention, or 
fragments thereof, may further be useful in diagnostic applications, wherein the presence or 
amount of the nucleic acid or the protein are to be assessed. These materials are further useful in 
the generation of antibodies that bind irrununospecifically to the novel substances of the 

1 5 invention for use in therapeutic or diagnostic methods. 

MOL8 

MOL8a 

A novel human Acetyl LDL Receptor-like nucleic acid was identified by TblastN using 
20 CuraGen Corporation's sequence file for MOL probes or homologs and run against the Genomic 
Daily Files made available by GenBank. The nucleic acid was further predicted by the program 
GenScan™, including selection of exons. These were further modified by means of similarities 
using BLAST searches. The sequences were then manually corrected for apparent 
inconsistencies, thereby obtaining the sequences encoding the full-length protein. The disclosed 
25 novel MOL8a nucleic acid of 980 nucleotides (also referred to as 1 1800699-0-16) is shown in 
Table 8B. An open reading frame begins with an ATG initiation codon at nucleotides 1-3 and 
ends with a TGA codon at nucleotides 2803-2805. A putative untranslated region downstream 
from the termination codon are underlined in Table 8A, and the start and stop codons are in bold 
letters. 

30 

Table 8A. MOL8a Nucleotide Sequence (SEQ ID NO:17) 

atggagggcgcagggccccggggggccgggccggcgcggcgccggggagcx:ggggggccgccgtcaccgctgctgc 
cgtcgctgctgctgctgcrtgctgctctggatgctgccggacaccgtggcgcctcaggaactgaaccctcgcggccg 
caacgtgtgccgtgctccxxk3ctcccaggtgcccacgtgctgcgctggct 

attgcggt'gtgcgaaggc^ctccacgtgctcagagaacgaggtgtgcgtgaggcctggcgagtgccgctgccgcc 
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ACGSCTACTTCGGTGCCAACTGCGACACCA(m^^ 

GAGAGATGG^CGGGGTCCAAGG , rGGGGCGGGGTCGGAa , CCGCCrTCGGGtXX^GTCCCCAGAGGT3GCGGCTGGA 
GTGwGGGACGCGGGCAC^GlTCCGGCTGGCCGGGGGCACCTACTCAAGCACCGGGGCCTTCCACCCCCrrCCGC'i'CCT 
CCCCTGCAGAGTGCCCGCGCCAGTTCTGGGGCCCCGACTGCAAGG^^ 

CG AG GACGTG ACAGGCCAGTGT ACT TGTCACG C GCGGCGCT GGG GC GCGCGCTGCG AGCAT GCGTGCCAGTGCC AG 
C ACGGCACGTGC CACCCCCGGAGC GGCG CG TGC CGCTGTGAGTC CG G CT G3TGGGGCGCGCAGTGCGCCAGCGCGT 
GCTACTGCAGCGCCACGTt^CGCTGCGACCCACAGACCGGCGCCTGCCTGTGCCACGCAGGCTGGTGGGGCCGCAG 
CTGCAACAACCAGTGCGCCTGCAACTCGTCTCCCTGCGAGCAGCAGAGCGGCCGCTGTCAGTGCCGCGAGCGTACG 
TTCGGCGCGCGCTGCGATCGCTACTGCCAGTGCTTCCGCGGCCGCTGCCACCCTGTGGACGGCACGTGTGCCTGCG 
AGCCGGGCTACCGCGGCAAGTACTGTCGCGAGCCGTGCCCCGCCGGCrTCTACGGCTTGGGCTGTCGCCGCCGGTG 
TGGCCAGTGCAAGGGCCAGCAGCCGTGCACGGTGGCCGAGGGCCGCTGCTTGACGTGCGAGCCCGGCTGGAACGGA 
ACCAAGTGCGACCAGCCTTGCGCCACCGGTTTCTArGGCGAGGGCTGCAGCCACCGCTGTCCGCCATGCCGCGACG 
GGCATGCCTGTAACCATGTCACCGGCAAGTGTACGCGCTGCAACGCGGGCTGGATCGGCGACCGGTSCGAGACCAA 
GTGTAGCAATGGCACTTACGGCGAGGACTGCGCCTTCGTGTGCGCCGACrGCGGCAGCGGACACTGCGACTTCCAG 
TCGGGGCGCTGCCTGTGCAGCCCTGGCGTCGACGGGCXXCACTGTAACGTGAC^ 

CGGACTGTGCTCAGGCCTGCAGCTGCCACGAGGACACGTGCGACCCGGTCACTGGTGCCTGCCACCTAGAAACCAA 
CCAGCGCAAGGG CGTGATGGGCGCG GGCGCGCTGCT CGTCCTGCTCGTCT GCCTGCT GCT CTCGCTGCTCGGCT GC 
TGCrGCGCTTGCCGCGGCAAGGACCCTACGCGCCGGGAGCTTTCGCTTGGGAGGAAGAAGGCGCCGCACCGACTAT 
GCGGGCGCTTCAGTCGCATCAGCATGAAGCTGCCCCGGATCCCGCTCCGGAGGCAGAAACTACCCAAAGTCGTAGT 
GGCCCACCACGACCTGGATAACACACTCAACTGCAGCTTCCTGGAGCCACCCTC^GGGCTGGAGCAGCCCTCACCA 
TCCrGGTCCTCTCGGGCCTCCTTCTCCTCGTTTGACACCACTGATGAAGGCCCTGTGTACTGTGTACCCCATGAGG 
AGGCACCAGCGGAGAGCCGGGACCCCGAAGTCCCCACTGTCCCTGCCGAGGCGCCGGCGCCGTCCCCTGTGCCCTT 
GACCACGCCAGCCTCCGCCGAGGAGGCGATACCCCTCCCCGCGTCCTCCGACAGCGAGCGGTCGGCGTCCAGCGTG 
GAGGGGCCCGGAGGGGCTCTGTACGCGCGCGTGGCCCGACGCGAGGCCCGGCCGGCCCGGGCCCGG3GCGAGATTG 
GGGGCCTGTCGCTGTCGCCATCGCCCGAGCGCAGGAAACCGCCGCCACCrGACCCCGCCACCAAGCCTAAGGTGTC 
CTGGATCCACGGCAAGCACAGCGCCGCTGCAGCTGGCCGTGCGCCCTCACCACCGCCGCCAGGCTCCGAGGCCGCG 
CCCAGCCCCAGCAAGAGGAAACGGACGCCCAGCGACAAATCGGCGCATACGGTCGAACACGGCAGCCCCCGGACCC 
GCGACCCAACGCCGCGCCCCCCGGGGCTGCCCGAGGAGGCGACAGCCCTCGC^ 

AGCGOSCGCCGCGCCCCGGCCrCTTGGAGCCCACGGACGCCGGCGGTCCCCCGCGAAGCGCGCCGAGGCTGCCTCC 
ATGTTGGCCGCTGACGTGCGCGGCAAGACTCGCAGCCTGGGCCGCGCCGAGGTGGCCCTGGGCGCGCAGGGCCCCA 
GGGAAAAGCCGGCGCCCCCAC^GAAAGCCAAGCGCTCCGTGCCGCCAGCCTCGCCCGCCCGCGCGCCCCCAGCGAC 
CGAAACCCCGGGGCCTGAGAAGGCGGCGACCGACTTGCCCGCGCCTGAGACCCCCCGGAAGAAGACCCCCATCCAG 
AAGCCGCCGCGCAAGAAGAGCCGGAGGCGGCGGGCGAGCTGGGCAGGGCGGGCGCACCCACCCTGTAG 



The MOL8a protein encoded by SEQ ID NO: 16 has 324 amino acid residues, and is 
presented using the one-letter code in Table 8B (SEQ ID NO:l 8). The SignalP, Psort and/or 
Hydropathy profile for MOLSa predict that M0L8a has a signal peptide and is likely to be 
5 localized at the plasma membrane with a certainty of 0.6000. The SignalP shows a signal 
sequence with a cleavage site between amino acids 43 and 44. This is typical of this type of 
membrane protein. Therefore it is likely that this novel human plasma membrane protein is 
available at the appropriate sub-cellular localization and hence accessible for the therapeutic uses 
described in this application. 

10 



Table 8B. Encoded MOL8a protein sequence (SEQ ID NO:18). 



ME GAG PRG AG P ARRRGAGG P PS PLL P S LLLLLLLWML P DT VAPQELN PRGRNVCRAPGSQV PTCC AGWRQQG DE CG 

IAVCEGNSTCSENEVCVRPGECRCRHGYFGAKCDTSERGVGPVLVGGAESWRDGAGSKVGRGRIRLRGGSPEVAAG 

VRDAGRFRIAGGTYSSTGAFHPLRSSPAECPRQFWGPDCKELCSCHPHGO^EDVTGO^TCHARRWGARCEHACQCQ 

HGTCHPRSGACRCESGWWGAO^^ACYCSATSRCDPQTGACIj^ 

FGARCDRYCC£FRGRCHPVTX3T<^CEPGYRGKYCREPCPAGEYGIX^R 

TKCDQPCATGFYGEGCSHRCPPCRDGIIACNHVTGKCTRCNAGWIGDRCETKCSNGTYGEIXIAEVCADCGSGHCDFQ 
SGRCLCSPGVHGPHQJVTCPPGLHGAIX^QACSCIIEDTC 

CCACRGKDPTRRELSLGRKKAPHRLCGRFSRISMKLPRI PLRRQKLPKWVAHHDLDNT1WCSFLEPPSGLEQPSP 
SWSSRASFSSFDTTDEGPVYCVPHEEAPAESRDPEVPTVPAEAPAPSPVPLTTPASAEEAIPLPASSDSERSASSV 
EGPGGALYARVARREARPARARGEIGGLSLSPSPERRKPPPPDPATKPKVSWIHGKHSAAAAGRAPSPPPPGSEAA 
PSPSKRKRTPSDKSAHT\TEHGSPRTRDPTPRPrcLPEEATAIJWPSPPPJtflAFJ^ 

MLAADVRGKTRSLGRAEVALGAQGPREKPAPPQKAKRS VP PAS PARAP PATET PGPEKAAT DLPAPET PRKKTPIQ 
KPPRKKS REAAGELGRAGAPTL 
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The full amino acid sequence of the protein of the invention was found to have 576/729 
(79%) identical and 596/729 (81%) similarity to a murine nurse cell receptor amino acid 
sequence (PatP Accession No. Y85616). The full amino acid sequence of the protein of the 
5 invention was also found to have 296 of 741 amino acid residues (39 %) identical and 383 of 741 
amino acid residues (51 %) homolog to the 830 amino acid residue acetyl LDL receptor 
precursor from Homo sapiens (human) (ACC:O43701). 

MOL8a is expressed in the following tissues: fetal thymus, mammary gland, fetal 
thymus, pool of ten tissues (adrenal, mammary, prostate, testis, uterus, bone marrow*, 
10 melanoma*, pituitary*, thyroid*, spleen) (*from mRNA rather than from total RNA). 



MOL8b 

A novel nucleic acid was identified by laboratory cloning of cDNA fragments, by in 
silico prediction of the sequence. cDNA fragments covering either the full length of the DNA 

1 5 sequence, or part of the sequence, or both, were cloned. In silico prediction was based on 
sequences available in Curagen's proprietary sequence databases or in the public human 
sequence databases, and provided either the full length DNA sequence, or some portion thereof. 
These were further modified by means of similarities using BLAST searches. The sequences 
were then manually corrected for apparent inconsistencies, thereby obtaining the sequences 

20 encoding the full-length protein. The disclosed novel MOL8b nucleic acid of 2598 nucleotides 
(also referred to as CG50S89-02) is shown in Table 8C. An open reading frame begins with an 
ATG initiation codon at nucleotides 1-3 and ends with a TAG codon at nucleotides 2596-2598. 
A putative untranslated region upstream from the initiation codon and downstream from the 
termination codon are underlined in Tabic 8D, and the start and stop codons are in bold letters. 

25 



Table 8D. MOL8b Nucleotide Sequence (SEQ ID NO:19) 

ATGGAGGGCGCAGGGCCCCGGGGGGCCGGGCCGGCGCGGCGCCGGGGAGCCGGGGGGCCGCCGTCACCGCTGCTGC 

CGTCGCTGCTGCTGCTGCTGCTGCTCTGGATGCTGCCGGACACCGTGGCGCCTCAGG7ACTGAACCCTCGCGGCCG 

O^CGTGTGCCGTGCTCCCXGCTCCCAGGTGCCCACGTGCTGCGCTGGCTGGAGGCAGCAAGGGGACGAGTGTGGG 

ATTGCGGTGTGCGAAGGCAACTCCACGTGCTCAGAGAACGAGGTGTGCGTGAGGCCTGGCa^GTGCCGCTGCCGCC 

ACGGCTACTTCGGTGCCAACTGCGACACCAAGTGCCCGCGCCAGTTCTGGGGCCCCGACTGCAAGGAGCTGTGTAG 

CTGCCACCCACACGGGCAGTGCGAGGACGTGACAGGCCAGTGTACTTGTCACGCGCGGCGCTGGGGCGCGCGCTGC 

GAGCATGCGTGCCAGTGCCAGCACGGCACGTGCCACCCGCGGAGCGGCGCGTGCCGCTGTGAGCCCGGCTGGTGGG 

GCGCGCAGTGCGCCAGCGCGTGCTACTGCAGCGCCACGTCGCGCTGCGACCCACAGACCGGCGCCTGCCTGTGCCA 

CGCAGGCTGGTGGGGCCGCAGCTGC^CAACCAGTGCGCCTGCAACTCGTCTCCCTGCGAGCAGCAGAGCGGCCGC 

TGTCAGTGCCGCGAGCGTACGTTCGGCGCGCGCTGCGATCGCTACTGCCAGTGCTTCCGCGGCCGCTGCCACCCTG 

TGGACGGCACGTGTGCCTGCGAGCCGGGCTACCGCGGCAAGTACTGTCGCGAGCCGTGCCCCGCCGGCTTCTACGG 

CTTGGGCTGTCGCCGCQGGTGTGGCCAGTGCAAGGGCCAGCAGCCGrGCACGGTGGCCGAGGGCX^GCTGCTTGACG 

TGCGAGCCCGGCTGGAACGGAACCAAGTGCGACCAGCCTTGCGCCACCGGTTTCTATGGCGAGGGCTGCAGCCACC 

GCTGTCCGCCATGCCGCGACGGGCATGCCTGTAACCATGTCACCGGCAAGTGTACGCGCTGCAACGCGGOT 

CGGCGACCGGTGCGAGACTAAGTGTAGCAATGGCACTTACGGCGAGGACTGCGCCTTCGTGTGCGCCGACTGCGGC 
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AGCGGACACTGCGACTTCC^STCGGGGCGCTGCCTGTGCAGCCCTGGCGTCCACGGGCCCCACTGTAACGTGACGT 
GCCCGCCCGGACTCCACGGCGCGGACTGTGCTCAGGCCTGCAGCTGCCACGAGGACACGTGCGACCCGGTCACTGG 
TGCCTGCCACCTAGAAACCAACCAGCGCAAGGGCGTGATGGGCGCGGGCGCX3CTGCTCGTGCTGCTCGTCTGCCTG 
CTGCTCTCXSCTGCTCGGCTGCTGCTGCGCTTGCCGCGGCAAGGACCCTACGCGCCGGGAGCTTTCGCTTGGGAGGA 
AGAAGGCGCCGCACCGACTArGCGGGCGCTTCAGTCGCATCAGCATGAAGCTGCCCCGGATCCCGCTCCGGAGGCA 
GAAACTACCCAAAGTCGTAGT^CCCACCACGACCTGGATAACACACTCAACTGCAGCTTCCTGGAGCCACCCTCA 
GGGCTGGAGCAGCCCTCACCATCCTGGTCCTCTCGGGCCTCCTTCTCCTCGTTTGACACCACTGATGAAGGCCCTG 
TGTACTGTGTACCCCATGAGGAGGCACCAGCGGAGAGCCGGGACCCCGAAGTCCCCACTGTCCCTGCCGAGGCGCC 
GGCGCCGTCCCCTGTGCCCT'PGACCACGCCAGCCTCCGCCGAGGAGGCGATACCCCTCCCCGCGTCCTCCGACAGC 
GAGCGGTCGGCGTCCAGCGTGGAGGGGCCCGGAGGGGCTCTGTACGCGCGCGTGGCCCGACGCGAGGCCCGGCCGG 
CCCGGGCCCGGGGCGAGATTGGGGGCCTG?CGCTGTCGCCATCGCCCGAGCGCAGGAAACCGCCGCCACCTGACCC 
CGCCACCAAGCCTAAGGTGTCCTGGATCCACGGCAAGCACAGCGCCGCTGCAGCTGGCCGTGCGCCCTCACCACCG 
CCGCCAGGCTCCGAGGCCGCGCCCAGCCCCAGCAAGAGG7\AACGGACGCCCAGCGACAAATCGGCGCATACGGTCG 
AACACGGCAGCCCCCGGACCCGCGACCCAACGCCGCGCCCCCCGGGGCTGCCCGAGGAGGCGACAGCCCTCGCTGC 
GCCCTCGCCGCCCAGGGCCCGAGCGCGCGCCGCGCCCCGGCCTCTTGGAGCCCACGGACGCCGGCGGTCCCCCGCG 
AAGCGCGCCGAGGCTGCCTCCATGTTGGCCGCTGACGTGCGCGGCAAGACTCGCAGCCTGGGCCGCGCCGAGGTGG 
CCCTGGGCGCGCAGGGCCCCAGGGAAAAGCCGGCGCCCCCACAGAAAGCCAAGCGCTCCGTGCCGCCAGCCTCGCC 
CGCCCGCGCGCCCCCAGCGACCGAAACCCCGGGGCCTGAGAAGGCGGCGACCGACTTGCCCGCGCCTGAGACCCCC 
CGGAAGAAGACCCCCATCCAGAAGCCGCCGCGCAAGAAGAGCCGGGAGGCGGCGGGCGAGCTGGGCAGGGCGGGCG 
CACCCACCCTGTAG 



The disclosed nucleic acid sequence has 1311 of 2041 bases (64%) identical to a 
gb:GENBANK-ID:D86864|acc:D86864.1 mRNA from Homo sapiens (Homo sapiens mRNA for 
acetyl LDL receptor, complete cds) (E value = 3.2e- 98 ). 

5 The MOL8b protein encoded by SEQ ID NO: 1 7 has 865 amino acid residues, and is 

presented using the one-letter code in Table 8E (SEQ ID NO:20). The SignalP, Psort and/or 
Hydropathy profile for MOLSa predict that MOLSa has a signal peptide and is likely to be 
localized at the plasma membrane with a certainty of 0.6000. The SignalP shows a signal 
sequence with a cleavage site between amino acids 43 and 44. This is typical of this type of 

10 membrane protein. Therefore it is likely that this novel human plasma membrane protein is 

available at the appropriate sub-cellular localization and hence accessible for the therapeutic uses 
described in this application. 



Table 8E. Encoded MOI.8L protein sequence (SEQ HD NO:20). 

MEGAGPRGAGPARRRGAGGPPSPLLPSLLLLLLLWMIiPDTVAPQELNPRGRNVCRAPGSQVPTCCAGWRQQGDECG 
IAVCEGNSTCSENEVCVRPGECRCRHGYFGANCDTKCPRQFWGPDCKELCSCHPHGQCEDVTGQCTCHARRWGARC 
EHACQCQHGTCHPRSGACRCE PGWWGAQC ASACYCS ATSRCDPQTGACLCHAGWWGRS CNNQCACNSS PCEQQSGR 
CQCRERTFGARCDRY CQC FRGRCHPVDGTCACEPGYRGKYCRE PC PAGFYGLGCRRRCGQCKGQQPCrVAEGRCI/T 
CE PGWNGTKCDQPCATG FYGEGCSHRC PPCRDGKACNH VTGKCTRCNAGWIGDRCETKCSNGTYGE DCAFVCADCG 
SGHCDFQSGRCLCSPGVHGPHCNVTCPPGLHGADCAQACSCHEDTCDP\^ 

LLSLLGCCCACRGKDPTRRELSI^RKKAPHRLCGRFSRISMKLPRIPLRRQKLPKVWAHHDLDNTLWCSFLEPPS 
GLEQPS PS WSSRASFSS FDTT DEG PVYCVPHEEAPAESRDPEVPTVPAEAPAPS PVPLTTPASAEEAI PLPASS DS 
ERS ASSVEGPGGALYARVARREARPARARGEIGGLSLS PS PERRKPPPPDPATKPKVSWIHGKHSAAAAGRAPS PP 
PPGSEAAPS PSKRKRTPS DKS AHTVEHGS PRTRDPT PRPPGLPEEATALAAPS PPRARARAAPRPLGAHGRRRS PA 
KRAEAASMLAADVRGKTRSLGRAEVALGAQGPREKPAPPQKAKRSVPPASPARAPPATETPGPEKAATDLPAPETP 
RKKT P I QKPPRKKSRE AAGELGRAGAPT L 



The full amino acid sequence of the protein of the invention was found to have 340 of 
1 5 823 amino acid residues (41 %) identical to, and 443 of 823 amino acid residues (53%) similar to, 
the 830 amino acid residue ptnr:SPTREMBL-ACC:O4370 1 protein from Homo sapiens 
(Human) (ACETYL LDL RECEPTOR PRECURSOR) (E value = 9.0e- 164 ). 
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Homology between MOL8a t 8b and the human acetyl LDL receptor are presented 
graphically in the multiple sequence alignment given in Table 8F (with MOLSa being shown on 
line 1, and MOL8b being shown on line 2) as a ClustalW analysis comparing MOL8 with related 
protein sequences. 

5 

Table 8F. Information for the ClustalW proteins: 

1) MOL8a (SEQ ID NO: 18) 

2) M0L8b (SEO ID NO: 20) 

3) gil 4507203|ref |NP_003684 .1 1 acetyl LDL receptor; SREC*scavenger receptor 
10 expressed by endothelial cells [Homo sapiens] (SEQ ID NO:57> 
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15 



KOLBa Prot 
MOLiSb Prot 
NP 003684 




KOLBa Prot 
HDL 6b Prot 
NP 003684 



MOLBa Prot 
MOLBb Prot 
NP 003684 



KOL8a Prot 
MOLBb Prot 
MP 003684 



MOLBa Prot 
MOLBb Prot 
NP 003684 



aadvrgktr^HslgraevaIlgaqgprekp 



870 
I 

! 



JlPKP PRQALNRKKG SPGQaSGSQg QSPNSAPKAG 
930 

....I.... | I I ....I 



LgGATGPMAQ RgSEAVgGLG 



884 
615 
781 



920 

..I . 

55 



940 

.1 . 



950 



KAATDLPAPE|TPRKKTPIQ 
KAATDLPAPElrPRKKTPIQ 



agJJJssrraq efvsgcgsE qdpqEqaeee 



934 
665 
630 



Chromosomal information: 

MOL8 maps to chromosome 22ql 1 . This assignment was made using mapping 
information associated with genomic clones, public genes and ESTs sharing sequence identity 
with the disclosed sequence and CuraGen Corporation's Electronic Northern bioinformatic tool. 

Tissue expression 

MOL8 is expressed in at least the following tissues: kidney, senescent fibroblasts, 
lymphocyte, B cell, and germ cell tumors. Expression information was derived from the tissue 
sources of the sequences that were included in the derivation of the sequence of CuraGen Acc. 
No. CG50889-02. 

MOL8 shows significant homologies to human LDL Receptor-like protein, as described 
in, but not limited to, the references below. Hypercholesterolemia is an autosomal dominant 
disorder characterized by elevation of serum cholesterol bound to low density lipoprotein (LDL). 
Mutations in the LDL receptor (LDLR) gene on chromosome 1 9 cause this disorder. Familial 
hypercholesterolemia is characterized by elevation of serum cholesterol bound to low density 
lipoprotein (LDL) and is, hence, one of the conditions producing the hyperlipoproteinemia II 
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phenotype (see OMIM 144400). Heterozygotes develop tendinous xanthomas, corneal arcus, and 
coronary artery disease; the last usually becomes evident in the fourth or fifth decade. 
Homozygotes develop these features at a accelerated rate in addition to planar xanthomas, which 
may be evident at birth in the web between the first two digits. 

5 Hepatitis C virus (HCV), the principal viral cause of chronic hepatitis, is not readily 

replicated in cell culture systems, making it difficult to ascertain information on cell receptors for 
the virus. However, several observations from studies on the role of HCV in mixed 
cryoglobulinemia provided some insight into HCV entry into cells. Evidence indicated that HCV 
and other viruses enter cells through the mediation of LDL receptors: by the demonstration that 

10 endocytosis of these viruses correlates with LDL receptor activity, by complete inhibition of 
detectable endocytosis by anti-LDL receptor antibody, by inhibition with anti-apolipoprotein E 
and anti-apolipoprotein B antibodies, by chemical methods abrogating lipoprotcin/LDL receptor 
interactions, and by inhibition with the endocytosis inhibitor phenylarsine oxide. Agnello et al. 
(1999) provided confirmatory evidence by the lack of detectable LDL receptor on cells known to 

15 be resistant to infection by one of these viruses, bovine viral diarrheal virus (BVDV). 

Endocytosis via the LDL receptor was shown to be mediated by complexing of the virus to very 
low density lipoprotein (VLDL) or LDL, but not high density lipoprotein (HDL). Studies using 
LDL receptor-deficient cells or a cytolytic BVDV system indicated that the LDL receptor may 
be the main but not exclusive means of cell entry of these viruses. 

20 Therapeutic uses of the composition 

The expression pattern, and protein similarity information for MOL8 that it may function 
as human LDL Receptor-like protein. Therefore, the nucleic acid and protein of the invention are 
useful in potential therapeutic applications implicated, for example but not limited to, metabolic 
disorders, e.g. Hypercholesterolemia, viral diseases, and other diseases and disorders. The 

25 homology to antigenic secreted and membrane proteins suggests that antibodies directed against 
the novel genes may be useful in treatment and prevention of metabolic disorders, e.g. 
Hypercholesterolemia, viral diseases, and other diseases and disorders. 

Potential therapeutic uses for the invention(s) are, for example but not limited to, the 
following: (i) Protein therapeutic, (ii) small molecule drug target, (iii) antibody target 

30 (therapeutic, diagnostic, drug targeting/cytotoxic antibody), (iv) diagnostic and/or prognostic 
marker, (v) gene therapy (gene delivery/gene ablation), (vi) research tools, and (vii) tissue 
regeneration in vitro and in vivo (regeneration for all these tissues and cell types composing these 
tissues and cell types derived from these tissues. 
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The nucleic acids and proteins of the invention are useful in potential therapeutic 
applications implicated in various cancers including those metabolic disorders, e.g. 
Hypercholesterolemia, viral diseases, and other diseases and disorders. For example, but not 
limited to, a cDNA encoding the novel human plasma membrane protein may be useful in gene 
5 therapy, and the novel human plasma membrane protein may be useful when administered to a 
subject in need thereof. By way of non-limiting example, the compositions of the present 
invention will have efficacy for treatment of patients suffering from, for example, but not limited 
to, various cancers including those of the metabolic disorders, e.g. Hypercholesterolemia, viral 
diseases, and other diseases and disorders. The novel nucleic acid encoding the novel human 

10 plasma membrane protein, and the novel human plasma membrane protein of the invention, or 
fragments thereof, may further be useful in diagnostic applications, wherein the presence or 
amount of the nucleic acid or the protein are to be assessed. These materials are further useful in 
the generation of antibodies that bind immunospecifically to the novel substances of the 
invention for use in therapeutic or diagnostic methods. 

1 5 These materials aTe further useful in the generation of antibodies that bind immuno- 

specifically to the novel MOL8 substances for use in therapeutic or diagnostic methods. These 
antibodies may be generated according to methods known in the art, using prediction from 
hydrophobicity charts, as described in the "Anti-MOLX Antibodies" section below. For 
example the disclosed MOL8 protein has multiple hydrophilic regions, each of which can be 

20 used as an immunogen. In one embodiment, a contemplated MOL8 epitope is from about amino 
acids 1 to 10. In another embodiment, a MOL8 epitope is from about amino acids 50 to 200. In 
further embodiment, a MOL8 epitope contains amino acids 210-400, 475-600, or 625-850. These 
novel proteins can also be used to develop assay system for functional analysis. 

25 MOL9 

MOL9a 

A novel nucleic acid encoding a neurolysin -like protein was identified by TblastN using 
CuraGen Corporation's sequence file for MOL9 probe or homolog, run against the Genomic 
Daily Files made available by GenBank. The nucleic acid was further predicted by the program 
30 GenScan™, including selection of exons. These were further modified by means of similarities 
using BLAST searches. The sequences were then manually corrected for apparent 
inconsistencies, thereby obtaining the sequences encoding the full-length protein. The disclosed 
novel MOL9a nucleic acid of 2355 nucleotides (also referred to as 1 95067 19_B_EXT) is shown 
in Table 9 A. An open reading frame begins with an ATG initiation codon at nucleotides 1-3 and 
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ends with a TG A codon at nucleotides 1915-1917. A putative untranslated region upstream from 
the initiation codon and downstream from the termination codon are underlined in Table 9 A, and 
the start and stop codons are in bold letters. 

The nucleic acid sequence has 1307 of 1428 bases (91%) identical to a pig neurolysin 
5 mRNA (GENB ANK-ID: AB000 1 70) (Expect = 0.0). 



Table 9A. MOL9a Nucleotide Sequence (SEQ ID NO:21) 

ATCTTGACTTTGGACCAACAGAAATCCCTAATTCTTATTCTTTTTCTGATTCTTTTTAGAGTTGGTGGTTC 
CAGGATTTTACTCAGAATGACGTTAGGAAGAGAAGTGATGTCTCCTCTTCAGGCAATGTCTTCCTATACTG 
TGGCTGGCAGAAATGTTTTAAGATGGGATCTTTCACCAGAGCAAATTAAAACAAGAACTGAGGAGCTCATT 
GTGCAGACCAAACAGGTGTACGATGCTGTTGGAATGCTCGGTATTGAGGAAGTAACTTACGAGAACTGTCT 
GCAGGCACTGGCAGTGGAAAGGACCATGCTAGACTTTCCCCAGCATGTATCCTCTGACAAAGAAGTACGAG 
C AGCAAGT AC AGAAGC AG ACAAAAGACTTTCT CG TTTTGAT AT T G AGATG AGCATGAG AGG AGAT ATAT TT 
GAGAGAATTGTTCATTTACAGCAGGJ^CCTGTGATCTGGGGAAGATAAAACCTGAGGCCAGACGATACTT 
GGAAA^GTCAATTAAAATGGGGAAAAGAAATGGGCTCCATCTTCCTGiVACAAGTACAGAATGAAATCAAAT 
C AATG AAGAAAAG AAT GAGTG AGCTATGT ATTGATTTT AACAAAAACCTC AATGAGGAT GAT ACCTTCCT T 
GTATTTTCCAJVGGCTGAACTTGGTGCTCTTCCTGATGATTTCATTGACAGTTTAGAAAAGACAGATGATGA 
CAAGT AT AAAATT ACC TT AAAATATC CAC ACTAT TT CCCTGT CATGAAGAAATGTT GT AT C CCTGAAACC A 
GAAGAAGGATGGAAAT GGCTTTTAAT ACAAGGTGCAAAGAGG AAAACACC AT AATTTT GCAGCAGCTACTC 
CCACT3CGAACCAAGGTGGCCAAACTACTCGGTTATAGCACACATGCTGACTTCGTCCTTGAAATGAACAC 
TG CAAAG AGCACAAGCCG CGT AAC AGCCTTTCTAG ATG ATTT AAGCCAGAAGTTAAAACCCTTGG GTGAAG 
C^GAACGAGAGTTTATTTTGAATTTGAAGAAAAAGGAATGCAAAGACAGGGGTTTTGAATATGATGGGAAA 
ATCAATGCCTGGGATCTATATTACTACATGACTC^GACAGAGGAACTCAAGTATTCCATAGACCAAGAGTT 
CCTCAAGGAATACTTCCCAATTGAGGTGGTCACTGAAGGCTTGCTGAACACCTACCAGGAGTTGTTGGGAC 
TTTC AT TTG AACAAATG AC AG ATGCTC AT GTTTGG AACAAG AGTG TTAC ACTTT AT ACTGT GAAG GAT AAA 
GCT AC AGG AGAAG7ATT G GGACAGTT CT ATTTGG ACCTCTAT CC AAG GCCTAGGGAAGG AAAAT AC AAT C A 
TGCGGCCTGCTTCGGTCTCCAGCCTGGCTGCCTTCTGCCTGATGGAAGCCGGATGATGGCAGTGGCTGCCC 
TCGTGGTGAACTTCTCACAGCCAGTGGCAGGTCGTCCCTCTCTCCTGAGACACGACGAGGTGAGGACTTAC 
TT T C ATG AG T TTG GT C AC GTGAT G CAT CAG AT TT GT GCACAG ACTGAT TTTGCACG AT T TAGCG 3AACAAA 
TGTGGAAACTGACTTTGTAGAGGTGCCATCGCAAATGCTTGAAAATTGGGTGTGGGACGTCGATTCCCTCC 
GAAGATTGTCAAAACATTATAAAGATGGAAGCCCTATTGCAGACGATCTGCTTGAAAAAC7TGTTGCTTCT 
AG GCTGGTCAACACAGGT ATGGGTT ATGT T ATTAGTAATAT AT AT TT T CT AGATATGTTTTCATT TCAGTG 
CATCCAAGTTGGAATGAAATACAGAAACCTAATCCTGAAACCTGGGGGATCTCTGGACGGCATGGACATGC 
TCCACAATTTCTTGAAACGTGAGCCAAACCAAAAAGCGTTCCTAATGAGTAGAGGCCTGCATGCTCCGTGA 
ACTGGGGATCTTTGGTAGCCGTCCATGTCTGGAGGACAAGTCGACATCACCATGTGTTACTGGCCTGGAAA 
CTGAAGGGAGTTTTGCAAGTGAAAATTTAGATTTCTATTGACATCCTTTTGTTTTCTAATTTTAAAAATTA 
T AAAG ATGT AAAT GGAAT TATAAAT ACTGTG ACC T AAGAAAAGACCC ACT AGAAAGT AATT GT ACT ATAAA 
ATTT C ATAAAACTGG ATT TG ATTTCTTTT TAT G AAAGTTTCATAT G AATGTAACTTG ATTT TTT ACTATTA 
TAATCTAGATAATATGATArAAGAGGGCTAAGAATTTTTAAATTGAATCATATATATGATATAArTTGATC 
CTTCTTGTATCTTGAAGTTTTGTACTTGGGATTTCTGGACTGATAAATGAATCATCACATTCTTCTGGTAA 
ATATTTTCTTGG . 



The MOL9a protein encoded by SEQ ID NO:21 and has 638 amino acid residues, and is 
presented using the one-letter code in Table 9B (SEQ ID NO:22). The SignalP, Psort and or 
1 0 Hydropathy profile indicate that this sequence has a signal peptide between positions 23 and 24 
and is likely to be localized at endoplasmic reticulum and plasma membrane (Certainty = 
0.8200). 



Table 9B. Encoded MOL9a protein sequence (SEQ ID NO:22) 

riLTLI^KSLILILFLILFRVGGSRILLRWTLGREVMSPI^AMSSYTVAGRNVI^WDLSPEQrKTRTE^LI 
VQTKQVYDAVGMLGIEEVTYENCLC^LAVERTM^ 

ERI VHLQQETCDLGKI KPEARRYLEKS I KMGKRKGLHLPEQVQNEIKSMKKRMSELCI DFNKNLNEDDTFL 
VFSKAELGALPDDFIDSLEKTDDDKYKITLKYPHYFPVMKKCCIPETRRRME^ 

PLRT KVAKLLG YS T HADFVLEMNT AK ST S RVT AFLDDLSQKLKPLGEAE RE FI LNLKKKEC KDRG FE YDGK 
INAWDLYYVMTQTEELKYSIDQEFLKEYFPIEWTEGLimTQELLGLSFECKrDAHVWNKSVTLYTVKDK 
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ATGEVLGQFYLDLYPRPREGKYNHAACFGLQPGCLLPDGSRMMAVAALVVNFSQPVAGRPSLLRHUtVHTy 
FHEFGHVMHQICAQTDFARFSGTNVETDFVEVPSQMLENWVWDVDSLRRLSKHYKDGSPIADD3.1.EKLVAS 
RLVNTGMGyVlSNIYFLDMFSFQCTQVGMKYRN],I].K?GG5T.DGKDMLHNFLKRF.PNQKAFI.MSRGLHAP 

The ftill amino acid sequence of the protein of the invention was found to have 483/564 
(85%) identity and 508/564 (90%) similarity to a rabbit cndopcptidase (PatP Accession No. 
R261 14), and have 571 of 632 amino acid residues (90%) identical to, and 592 of 632 residues 
5 (93.6%) similar to, the 704 amino acid residue neurolysin protein from pig (ptnr:SPTREMBL- 
ACC: Q02038) (E value = l.le- 302 ). 



MOL9a also has homology to the proteins shown in the BLAST alignments in Table 9C. 



Table 9C. BLASTX results for MOL9a 






Smallest 








Sum 






High 


Prob 




Sequences producing High-scoring Segment Pairs: 


Score 


P(N) 


N 


ptnr:SWISSNEW-ACC:Q0203B NEUROLYSIN PRECURSOR (EC 3.4.24.. 


. 2665 


l.le-302 


2 


ptnr:SWISSPR0T-ACC:P42675 NEUROLYSIN PRECURSOR {EC 3.4.24. 


. 2697 


2.2e-302 


2 


ptnr:SPTREKBL-ACC:P79433 ENDOPEPTIDASE 24.16 (EC 3.4.-.-). 


. 2677 


7 .6e-302 


2 


ptnr:SWISSPROT-ACC:P4267 6 NEUROLYSIN PRECURSOR (EC 3.4.24. 


. 2601 


4.7e-291 


2 


ptnr:SWISSPROT-ACC:P4778 8 THIMET OLIGOPEPTIDASE (EC 3.4.2. 


. 1792 


6.4e-197 


2 



Chromosomal information 

1 0 MOL9a maps to the Unigene entry Hs. 22 1 5 1 which maps to chromosome 5 between 

markers D5S427-D5S647 (69.6-74.7 cM). 

Tissue expression 

MOL9a is expressed in at least the following tissues: fetal lung, testis, B-cell, aorta, 
1 5 brain, colon, foreskin, germ cell, heart, kidney, pancreas, stomach, uterus, whole embryo and 
cancer cell lines MDA-MB-231 and MCF-7. 

These materials are further useful in the generation of antibodies that bind immuno- 
specifically to the novel MOL9a substances for use in therapeutic or diagnostic methods. These 

20 antibodies may be generated according to methods known in the art, using prediction from 
hydrophobicity charts, as described in the "Anti-MOLX Antibodies" section below. For 
example the disclosed MOL9a protein has multiple hydrophilic regions, each of which can be 
used as an immunogen. In one embodiment, a contemplated MOL9a epitope is from about 
amino acids 50 to 75. In another embodiment, a MOL9a epitope is from about amino acids 100 

25 to 150. In further embodiments, MOL9a epitopes are found in amino acids 175-200, 225-300, 
325-375, 425-450, 500-550, and 600-625. These, novel proteins can also be used to develop 
assay system for functional analysis. 
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MOL9b 

The cloned open reading frame, codes for a 687 amino acid long protein with an overall 
95% amino acid identity, to the mature form of the pig neurolysin precursor (SWISSPROT- 
5 ACC:Q02038).Oligonucleotide primers were designed to PCR amplify a DNA segment, 
representing an ORF, coding for the mature form of 1 95067 19_B EXT.The forward primer 
includes an, in frame, BamHI restriction site. The reverse primer contains an, in frame, Xhol 
restriction site. The sequences of the PCR primers are the following: 



10 19506719_B-EXT Mat-Forw: 

GGATCCTCCAGGATTTTACTCAGAATGACGTTAGG (SEQ ID NO:58) 
19506719_B-EXT FL-Rev: 

CTCGAGCGGAGCATGCAGGCCTCTACTCATTAGGAACG (SEQ IDNO:59) 

1 5 PCR reactions were set up using a total of 5ng cDNA, consisting equal amounts of 

cDNA derived from human fetal brain, testis, skeletal muscle and mammary, template, 1 microM 
of each of the 1 95067 19_B-EXT Mat-Forw and 19506719_B-EXT FL-Rev primers, 5 
micromoles dNTP (Clontech Laboratories, Palo Alto CA) and 1 microliter of 50xAdvantage-HF 
2 polymerase (Clontech Laboratories, Palo Alto CA) in 50 microliter volume. The following 

20 reaction conditions were used: 
a) 96°C 3 minutes 

b) 96°C 30 seconds denaturation 

c) 60°C 30 seconds annealing 

d) 72°C 3 minute extension. 

25 Repeat steps b-d 35 times 

e) 72oC 10 minutes seconds final extension 



A single, 2.1 kb large, PCR product, was isolated from agarose gel and ligated to pCR2.1 
vector (Invitrogen, Carlsbad, CA). The cloned insert was sequenced, using vector specific, Ml 3 
30 Forward(-40) and M13 Reverse primers as well as the gene specific primers: 

GGACCATGCTAGACTTTCC, 19506719_B-EXT SI; (SEQ IDNO:60) 
GGAAAGTCTAGCATGGTCC, 1 95067 19_B-EXT S2; (SEQ ID NO:61) 
GGCTGAACITGGTGCTCTTCC, 1 95067 19_B-EXT S3; (SEQ ID NO:62) 
GGAAGAGCACCAAGTTCAGCC, 1950671 9JB-EXT S4; (SEQ IDNO:63) 
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GGCTTGCTGAACACCTACC, 1 95067 19_B-EXT S7; (SEQ ID NO:64) 
GGTAGGTGTTCAGCAAGCC, 1 95067 19_B-EXT S8; (SEQ ID NO:65) 
GCACAGACTGATTTTGCACG, 1 95067 19_B-EXT S9; (SEQ ID NO:66) and 
CGTGCAAAATCAGTCTGTGC, 1 95067 19_B-EXT SI 0 (SEQ ID NO:67). 



The disclosed novel MOL9b nucleic acid of 2061 nucleotides (also referred to as 
MOL9b) is shown in Table 9D. It is thought that MOL9b is an internal fragment of an open 
reading frame. Therefore its 5' and 3 s termini may be extended 



Table 9D. MOL9h Nucleotide Sequence (SEQ ID NO:23) 

TCCAGGATTTTACTCAGAATGACGTTAGGAAGAGAAGTGATGTCTCCTCTTCAGGCAATGTCTTCCTATAC 
TGTGGCTGGCAGAAATGTTTTAAGATGGGATCTTTCACCAGAGCAAATTAAAACAAGAACTGAGGAGCTCA 
TTGT G C AG ACCAAACAGGTGT ACG AT GCTGTT GG AAT GCTCGGTATTG AGGAAG T AACTTACGAGAACT GT 
CT GC AGGC ACTGGCAG ATGTAGAAGT AAAGTAT ATAGTGG AAAGGACC ATGCT AGACTTTC CCCAGCATGT 
ATCCTCTGACAAAGAAGTACGAGCAGCAAGTACAGAAGCAGACAAAAGACTTTCTCGTTTTGATATTGAGA 
TG AGC AT GAG AGGAGATAT ATTTGAGAGAATTGT T C ATT T ACAGGAAACCTGT GATCTGGGG AAGATAAAA 
CC TGAGGCCAGACGAT ACTTGG AAAAGTCAATT AAAATGGGGT^TAAGAAATGGGCTCCATCTTCCT GAACA 
AGTACAGAATGAAATCAAATCAATGAAGAAAAGAATGAGTGAGCTATGTATTGATTTTAACAAAAACCTCA 
ATGAGGATGATACCTTCCTTGTATTTTCCAAGGCTGAACTTGGTGCTCTTCCTGATGATTTCATTGACAGT 
TTAGAAAAGACAGATGATGACAAGTATAAAATTACCTTAAAATATCCACACTATTTCCCTGTCATGAAGAA 
AT GTT GT ATCCCTGAAACC AG AAG AAGGAT G G AAATG GCTT TTAAT AC AAGGT GCAAAG AGGAAAACACCA 
TAATTTTGCAGCAGCTACTCCCACTGCGAACCAAGGTGGCCAAACTACTCGGTTATAGCACACATGCTGAC 
TTCGTCCTTGAAATGAACACTGCAAAGAGCACAAGCCGCGTAACAGCCTTTCTAGATGATTTAAGCCAGAA 
GTTAAAACCCTTGGGTGAAGCAGAACGAGAGTTTA^TTTGAATTTGAAGAAAAAGGAATGCAAAGACAGGG 
GT TTT GAATATGAT GGGAAAATC AATGCCTG GG ATCT AT ATT ACT ACATGACT C AGAC AG AGGAACTCAAG 
TATTCCATAGACCAAGAGTTCCTCAAGGAATACTTCCCAATTGAGGTGGTCACTGAAGGCTTGCTGAACAC 
CTACCAGGAGTTGTTGGGACTTTCATTTGP-ACAAATGACAGATGCTCATGTTTGGAAC71AGAGTGTTACAC 
TTTATACTGTGAAGGATAAAGCTACAGGAGAAGTATTGGGACAGTTCTATTTGGACCTCTATCCAAGGGAA 
GGAAAATACAATCATGCGGCCTGCTTCGGTCTCCAGCCTGGCTGCCTTCTGCCTGATGGAAGCCGGATGAT 
GGCAGTGGCTGCCCTCGTGGTGAACTTCTCACAGCCAGTGGCAGGTCGTCCCTCTCTCCTGAGACACGACG 
AGGTGAGGACTTACTTTCATGAGTTTGGTCACGTGATGCATCAGATTTGTGCACAGACTGATTTTGCACGA 
TTTAGCGGAACAAATGTGGAAACrGACTTTGTAGAGGTGCCATCGCAAATGCTTGAAAATTGGGTGTGGGA 
CGTCG ATTCCCTCCGAAGATT GT C AAAAC ATT AT AAAG AT GGAAGCCCT ATTG CAG ACGAT CTG CTT GAAA 
AACTTGTTGCTTCTAGGCTGGTCAACACAGGTCTTCTGACCCrGCGCCAGATTGTTTTGAGCAAAGTTGAT 
CAGTCTCTTC^TACCAACACATCGCTGGATGCTGCyVAGTGAATATGCCAA^ 

AGTTGCACCTACTCCAGGCACAAATATGCCAGCTACCTTTGGAC^TTTGGCAGGGGGATACGATGGCCAAT 

ATT AT GGAT ATCTTTGG AGTG AAG T ATTTT CC ATGGAT ATGTTTT ACAGCTGT TTT AAAAAAGAAGGGATA 

ATGAATCCGGAGGTTGGAATGAAATACAGAAACCTAATCCTGAAACCTGGGGGATCTCTGGACGGCATGGA 

(^TGCTCCACAATTTCTTGAAACGTGAGCCJ^AACCAAAAAGCGTTCCTAATGAGTAGAGGCCTGCA 

CG ■ 



The MOL9b protein encoded by SEQ ID NO:21 has 687 amino acid residues, and is 
presented using the one-letter code in Table 9E (SEQ ID NO:24). 



Table 9E. Encoded MOL9b protein sequence (SEQ ID NO:24) 

SRILLIWTLGREVMSPLQAMSSYTVAGRNVLRWDLSPEQIKTRTEELIVQTKQVYDAVGMLGIEEVTYENC 
I^ALADVE\^IVERTMLDFPQHVSSDKEVRAASTEADKRLSRFDIEMSMRGDiraRI\n{LQETCDLGKIK 
PEARRYLEKSIKKGKRNGLHLPEQVQNEIKSMKKRMSELCIDFNKNLNEDDTFLVFSKAELGALPDDPIDS 
LEKTDDDKYKITLKYPHYFPVMKKCCIPETRRRWE^FNTRCKEENTIILQQLLPLRTKVAKIJ^YSTHAD 
FVLEMNTAKSTSRVTAFLDDLSQKLKPLGEAEREFILNLKKKECKDRGFEYDGKINAWDLYYYMTQTEELK 
YSIDQEFLKEYFPIEVVTEGLLNTYQELI/SLSFE^TDAHVVTOKSVTLYTVKDKATGEVLGQFYLDLYPRE 
GKYNHAAC FGLQPGCLLP DGS RMMAVAALWN FSQPVAGRPSLLRHDEVRTYFHEFGHVMHQICAQTDFAR 
FSGTm^ETDFVEVPSQMLENWVWDVDSLRRLSKHYKIXSSPIADDLl^KLVASRLWTGLLTI^QIVLSKV^ 
QS LHTNT S LDAAS E YAKYC S E I LGVAAT PGTNM P AT FGHLAGGY DGC YYG YLWS EV FSMDMFYS C FKKEG I 
MN PEVGMKYRNLILKPGGSLDGMDMLHNFLKREPWQKAFLMSRGLHAP 
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MOL9c 

In the present invention, the target sequence identified previously, Accession Number 
195067 19JB_EXT, was subjected to the exon linking process to confirm the sequence. PCR 
5 primers were designed by starting at the most upstream sequence available, for the forward 

primer, and at the most downstream sequence available for the reverse primer. In each case, the 
sequence was examined, walking inward from the respective termini toward the coding 
sequence, until a suitable sequence that is either unique or highly selective was encountered, or, 
in the case of the reverse primer, until the stop codon was reached. Such primers were designed 
1 0 based on in silico predictions for the full length cDNA, part (one or more exons) of the DNA or 
protein sequence of the target sequence, or by translated homology of the predicted exons to 
closely related human sequences sequences from other species. These primers were then 
employed in PCR amplification based on the following pool of human cDNAs: adrenal gland, 
bone marrow, brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia 
1 5 nigra, brain - thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, 

kidney, lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, salivary 
gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea, 
uterus. Usually the resulting amplicons were gel purified, cloned and sequenced to high 
redundancy. The resulting sequences from all clones were assembled with themselves, with 
20 other fragments in CuraGen Corporation's database and with public ESTs. Fragments and ESTs 
were included as components for an assembly when the extent of their identity with another 
component of the assembly was at least 95% over 50 bp. In addition, sequence traces were 
evaluated manually and edited for corrections if appropriate. These procedures provide the 
sequence reported below, which is designated Accession Number CG56222-01 
25 The disclosed novel MOL9c nucleic acid of 21 67 nucleotides (also referred to as 

CG56222-01) is shown in Table 9F An open reading frame begins with an ATG initiation codon 
at nucleotides 16-1 8 and ends with a TGA codon at nucleotides 2128-2130. Aputative 
untranslated region upstream from the initiation codon and downstream from the termination 
codon are underlined in Table 9F, and the start and stop codons are in bold letters. 
30 In a search of sequence databases, it was found, for example, that the nucleic acid 

sequence has 2000 of 2167 bases (92%) identical to a gb:GENBANK- 
ID:AB000170|acc:AB000170.1 mRNA from Sus scrofa (Porcine mRNA for endopeptidase 
24.16, complete cds) (Expect = 0.0). 
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Table 9F. MOL9c Nucleotide Sequence (SEQ TO NO:25) 

C CTCTCAGCGCTCCCA TGATCGCCCGGTGCCTTTTGGCTGTGCGRftGCCTCCGCAGAGTTGGTGGTTCCAG 
GATTTTACTCAGAATGACGTTAGGAAGAGflAGTGATGTCTCCTCTTCAGGCAATGTCTTCCTATACTGTGG 
CTGGCAGAAATGTTTTAAGATGGGATCTTTCACCAGAGCAAATTAAAACAAGAACTGAGGAGCTCATTGTG 
C^GACCAAACAGGTGTACGATGCTGTTGGA.ATGCTCGGTATTGAGGAAGT^ 

GGCACTGGCAGATGTAGAAGTAAAGTATATAGTGGAAAGGACCATGCTAGACTTTCCCCAGCATGTATCCT 

CTGACAAAGAAGTACGAGCAGCAAGrACAGAAGCAGACAAAAGACTTTCTCGTrTTGATATTGAGATGAGC 

AT G AG AGG AGATATAT TT G AGAG AATTGTTCATTT AC AGG AAACCTG T G ATCTGGGGAAGAT AAAACCTG A 

GG CCAGAC GATAC TT GGAAAAGTCAATT AAAATGGGG AAAAG AAATGGGCTC C AT CTTCCT GAACAAGT AC 

AGAAT GAAATCAAATC AATGAAGAAAAGAATGAGTGAG CT ATGTATTGATTTT AACAAAAACCTCAATGAG 

GATGATAC CTTCCT TGTATTTTCCAAGGCTGAACTTGGTGCTCTTCCTGATGATTTCATTGACAGTTTAGA 

AAAG AC AG ATGATGACAAGTAT AAAATTACCTT AAAAT ATCCACACT AT TTCCC TGTCATGAAG AAATGTT 

GT ATCCCT G AAAC(^G AAGAAGG ATGGAAATGGCTTT TAAT ACAAGGTGC AAAGAGGAAAACACCAT AAT T 

TTGCAGCAGCTACTCCCACTGCGAACCAAGGTGGCCAAACTACTCGGTTATAGCACACATGCTGACfTCGT 

CCTTG AAATGAAC ACTGC AAAGAGCACAAGCCGCGT AACAGCCTT TCT AGATG ATTTAAGC CAG AAGTT AA 

AACCCTTGGGTGAAGCAGAACGAGAGlTTATTTTGAATTTGAAGAAAAAGGAATGCAAAGACAGGGGTTrT 

G AAT ATGATGGG AAAATC AAT GCC TGGGATCT AT AT T ACT ACATGACTCAGAC AGAGGAACTC AAGT ATTC 

C71TAGACCAAGAGTTCCTC^AGGAATACTTCCCAATTGAGGTGGTCACTGAAGGCTTGCTGAACACCTACC 

AGG AGTTGTTGGG ACTTT CATTTGAACAAAT GAC AGATGCT C ATGTTT GG AACAAGAGTGTT AC ACTTT AT 

ACTGTGAAGGATAAAGCTACAGGAGAAG^A'J'TGGGACAGTTCIATTTGGACCTCTATCCAAGGGAAGGAAA 

ATACAATCATGCGGCCTGCTTCGGTCTCCAGCCTGGCTGCCTTCTGCCTGATGGAAGCCGGATGATGGCAG 

TGGCTGCCCTCGTGGTGAACTTCTCACAGCCAGTGGCAGGTCGTCCCTCTCl'CCTGAGACACGACGAGGTG 

AGGACTTACTTTCATGAGTTTGGTCACGTGATGCATCAGATTTGTGCACAGACTGATTTTGCACGA1TTAG 

CGGAACAAATGTGGAAACTGACTTTGTAGAGGTGCCZATCGC7VAATGCTTGAAAATTGGGTGTGGGACGTCG 

ATTCCCTCOSAAGATTGTCAAAACATTATAAAGATGGAAGCCCTATTGCAGACGATCTGCTTGAAAAACTT 

GTTGCTTCTAGGCTGGTCAACACAGGTCTTCrGACCCTGCGCCAGATTGTTTTGAGCAA^ 

TCTTC AT ACCAAC ACATC GCT GGATGCTGCAAGTGAATATGCCAAATACT GCTCAGAAAT ATTAGG AGTTG 

C AGCT AC TCC AGGC ACAAAT AT GCCAGCT ACCT TT GG ACATTTGGCAGGGGGAT ACGATGG CCAAT ATT AT 

GGATATCTTTGGAGTGAAGTATTTTCCATGGATATGTTTTACAGCTGTTTTAAAAAAGAAGGGATAATGAA 

TCCGGAGGTTGGAATGAAATACAGAAACCTAATCCTGAAACCTGGGGGATCTCTGGACGGCATGGACATGC 

TCCACAATTTCTTGAAACGTGAGCCAAACCAAAAAGCGTTCCTAATGAGTAGAGGCCTGCATGCTCCGTGA 

ACTGGGGATCTTTGGTAGCCGTCCATGTCTGGAGGAC 



The MOL9c protein encoded by SEQ ID NO:25 has 703 amino acid residues, and is 
presented using the one-letter code in Table 9G (SEQ ID NO:26). The SignalP, Psort and/or 
Hydropathy profile for MOL9c predict that MOL9c has a signal peptide and is likely to be 
5 localized at the cytoplasm with a certainty of 0.9200. The SignalP predicts a cleavage site at the 
sequence between amino acids 17 and 18. 



Table 9G. Encoded MOL9c protein sequence (SEQ ID NO:26) 

MIARCLLAVRSLRRVGGSRILLRMTLGREVMSPLQA^ 
YDAVGMI^IEEVTYENCLC^UjADVEVKYIVERTMLDF 

FE RI VHLQET C DLGK I KP E ARRYLEKS I KMGKRNGLHLPEQVQNE I KSMKKRMSELC I D FN KNLNE DDT FL 

VFSKAELGALPDDFIDSLEKTDDDKYKITLKYPHYFPVMKKCCIPETRRRMEMAFNTRCKEENTIILQQLL 

PLRTKVAKLI^YSTHADFVLEbOTAKSTSRVTAFLDDI^QKLKPLGEAEREFILtn^KKKECKDRG 

INAWDLYYYKTQTEELKYS I DQEFLKE YFPIEWTEGLLNTYQELLGLS FEQMTDAH VHNKSVTLYTVKDK 

ATGEVLGQFYLDLYPREGKYNHAACFGLQPGCLLPDGSRt^VAALVVNFSQPWGRPSLLRHDEVRTYra 

EFGHVMHQICAQT DFARFSGTNVET DFVEVPSQMLENWVWDVDSLRR1S KH YKDGS P I ADDLLEKLVASRL 

VNTGIiTLRQIVLSKVDQSLHTNTSLDAASEYAKYCSEILGVA^^ 

EVFSMDMFYSCFKKEGIMNPEVGMKYRNLILKPGGSLDGMDMLHKFLKREPNQKAFLM5RGLHAP 



The full amino acid sequence of MOL9c was found to have 657 of 704 amino acid 

1 0 residues (93%) identical to, and 687 of 704 amino acid residues (97%) similar to, the 704 amino 

acid residue ptnr:SWISSPROT-ACC:P42675 protein from Oryctolagus cuniculus (Rabbit) 

(NEUROLYSIN PRECURSOR (EC 3.4.24.16) (NEUROTENSIN ENDOPEPTIDASE) 
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10 



15 



20 



(MITOCHONDRIAL OLIGOPEPTIDASE M) (MICROSOMAL ENDOPEPTIDASE) (MEP) 
(E value = 0.0) 

MOL9c is expressed in at least the following tissues: Artery, Brain, Bronchus, Cartilage, 
Cervix, Colon, Coronary Artery, Dermis, Epidermis, Foreskin, Heart, Kidney, Liver, Ovary, 
Pancreas, Pituitary Gland, Placenta, Prostate, Salivary Glands, Synovium/Synovial membrane, 
Thalamus, Umbilical Vein, Uterus. This information was derived by determining the tissue 
sources of the sequences that were included in the invention including but not limited to 
SeqCalling sources, Public EST sources, Literature sources, and/or RACE sources. 

Possible SNPs found for MOL9c are listed in Table 9H. 



Table 9H: SNPs 


Consensus 
Position 


Depth 


Base 
Change 


399 


99 


A> G 


858 


51 


C> A 


863 


50 


T> A 


1242 


48 


T>C 


1810 


141 


G> A 


1824 


143 


T>C 


1892 


144 


T>C 



25 



These materials are further useful in the generation of antibodies that bind immuno- 
specifically to the novel MOL9c substances for use in therapeutic or diagnostic methods. These 
antibodies may be generated according to methods known in the art, using prediction from 
hydrophobicity charts, as described in the "Anti-MOLX Antibodies" section below. For 
example the disclosed MOL9c protein has multiple hydrophilic regions, each of which can be 
used as an immunogen. In one embodiment, a contemplated MOL9c epitope is from about 
amino acids 25 to 75. In another embodiment, a MOL9c epitope is from about amino acids 100 
to 200. In further embodiments, MOL9a epitopes are found in amino acids 250-400, 450-550, 
and 650-700. These novel proteins can also be used to develop assay system for functional 
analysis. 

Homology between the MOL9 isoforms and other homologous proteins is presented 
graphically in the multiple sequence alignment given in Table 91 (with MOL9a being shown on 
line 1, MOL9b on line 2, and MOL9c on line 3) as a ClustalW analysis comparing MOL9 with 
related protein sequences. 
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Table 91. Information for the ClustalW proteins: 



10 



1) MOTj9a (SEQ ID NO: 22) 

2) MOL9b (SEQ ID NO: 24) 

3) MOL9c (SEQ ID NO: 26) 

4) SWISSNFW-ACC:Q0?038 NE13ROLYSIN PRECURSOR (SEQ ID NO: 68) 
3) SWISSPROT-ACC:P4 2675 NEOROLYSIN PRECURSOR (SEQ ID NO:69) 

5) SPTREMEL-ACC: P79433 ENDOPEP7IDASE 24.16 (SEQ ID NO: 70) 

6) SWISSPROT-ACC: P42676 NEUROLYSIN PRECURSOR (SEQ ID NO: 71) 

7) ACC:P47788 THIMST OLIGOPEPTIDASE (SEQ ID NO: 72) 
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P47788 245 

git "73053531 R}]DQ vgKMFFSKXPL 

gi | 392 78001 AflCKGlfifUJXQRAAWPGHAQNC 



Endopeptidase 24.16 or mitochondrial oligopeptidase, abbreviated here as EP 24.1 6 
(MOP), is a thiol- and metal-dependent oligopeptidase that is found in multiple intracellular 
compartments in mammalian cells. From an analysis of the corresponding gene, we found that 

5 the distribution of the enzyme to appropriate subcellular locations is achieved by the use of 

alternative sites for the initiation of transcription. The pig EP 24.16 (MOP) gene spans over 100 
kilobases and is organized into 16 exons. The core protein sequence is encoded by exons 5-16 
which match perfectly with exons 2-13 of the gene for endopeptidase 24.15, another member of 
the thimet oligopeptidase family. These two sets of 1 1 exons share the same splice sites, 

10 suggesting a common ancestor. Multiple species of rnRNA for EP 24.16 (MOP) were detected 
by the 5-rapid amplification of cDNA ends and they were shown to have been generated from a 
single gene by alternative choices of sites for the initiation of transcription and splicing. Two 
types of transcript were prepared, corresponding to transcription from distal and proximal sites. 
Their expression in vitro in COS-1 cells indicated that they encoded two isoforms (long and 

15 short) which differed only at their amino termini: the long form contained a cleavable 

mitochondrial targeting sequence and was directed to mitochondria; the short form, lacking such 
a signal sequence, remained in the cytosol. The complex structure of the EP 24.16 (MOP) gene 
thus allows, by alternative promoter usage, a fine transcriptional regulation of coordinate 
expression, in the different subcellular compartments, of the two isoforms arising from a single 

20 gene. PMID: 9182559, UI: 97326108 We have isolated a metallopeptidase from rat liver. The 
peptidase is primarily located in the mitochondrial ihtermembrane space, where it interacts non- 
covalently with the inner membrane. The enzyme hydrolyzes oligopeptides, the largest substrate 
molecule found being dynorphin Al-17; it has no action on proteins, and does not interact with 
alpha 2-macroglobulin, and can therefore be classified as an oligopeptidase. We term the enzyme 

25 oligopeptidase M. Oligopeptidase M acts similarly to thimet oligopeptidase (EC 3.4.24.15) on 
bradykinin and several other peptides, but hydrolyzes neurotensin exclusively at the -Pro+Tyr- 
bond (the symbol + is used to indicate a scissile peptide bond) rather than the -Arg+Arg- bond. 
The enzyme is inhibited by chelating agents and some thiol-blocking compounds, but differs 
from thimet oligopeptidase in not being activated by thiol compounds. The peptidase is inhibited 

30 by Pro-lie, unlike thimet oligopeptidase, and the two enzymes are separable in chromatography 

on hydroxyapatite. The N-tenninal amino acid sequence of rat mitochondrial oligopeptidase M 

contains 19 out of 20 residues identical with a segment of rabbit microsomal endopeptidase and 

1 7 matching the corresponding segment of pig-soluble angiotensin II-binding protein. Moreover, 
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the rat protein is recognized by a monoclonal antibody against rabbit soluble angiotensin H- 
binding protein, all of which is consistent with these proteins being species variants of a single 
protein that is a homologue of thimet oligopeptidase. The biochemical properties of the 
mitochondrial oligopeptidase leave us in no doubt that it is neurolysin (EC 3.4.24.16), for which 

5 no sequence has previously been reported, and which has not been thought to be mitochondrial. 
PMID: 7836437, UI: 95 1 38 1 7 1 We have isolated by immunological screening of a lambda 
ZAPII cDNA library constructed from rat brain mRNAs a cDNA clone encoding endopeptidase 
3.4.24.16. The longest open reading frame encodes a 704-amino acid protein with a theoretical 
molecular mass of 80,202 daltons and bears the consensus sequence of the zinc metalloprotease 

10 family. The sequence exhibits a 60.2% homology with those of another zinc metallopeptidase, 
endopeptidase 3.4.24.1*5. Northern blot analysis reveals two mRNA species of about 3 and 5 
kilobases in rat brain, ileum, kidney, and testis. We have transiently transfected COS-7 cells with 
pcDNA3 containing the cloned cDNA and established the overexpression of a 70-75-kDa 
immunoreactive protein. This protein hydroryzes QFS, a quenched fluorimetric substrate of 

15 endopeptidase 3.4.24.16, and cleaves neurotensin at a single peptide bond, leading to the 

formation of neurotensin (1-10) and neurotensin (1 1 -13). QFS and neurotensin hydrolysis are 
potently inhibited by the selective endopeptidase 3.4.24. 16 dipeptide blocker Pro-lie and by 
dithiothreitol, while the enzymatic activity remains unaffected by phosphoramidon and captopril, 
the specific inhibitors of endopeptidase 3.4.24.1 1 and angiotensin-converting enzyme, 

20 respectively. Altogether, these physicochemical, biochemical, and immunological properties 
unambiguously identify endopeptidase 3.4.24.16 as the protein encoded by the isolated cDNA 
clone. PMID: 7592986, UI: 96070836 A human genomic clone encompassing exons 1-3 of the 
neurotensin/neuromedin N gene was identified using a canine neurotensin complementary DNA 
probe. Sequence comparisons revealed that the 120-amino acid portion of the precursor sequence 

25 encoded by exons 1-3 is 89% identical to previously determined cow and dog sequences and that 
the proximal 250 bp of 5' flanking sequences are strikingly conserved between rat and human. 
The 5' flanking sequence contains cis-regulatory sites required for the induction of 
neurotensin/neuromedin N gene expression in PC 12 cells, including API sites and two cyclic 
adenosine-5-rnonophosphate response elements. Oligonucleotide probes based on the human 

30 sequence were used to examine the distribution of neurotensin/neuromedin N messenger RNA in 
the ventral mesencephalon of schizophrenics and age- and sex-matched controls. 
Neurotensin/neuromedin N messenger RNA was observed in ventral mesencephalic cells some 
of which also contained melanin pigment or tyrosine hydroxylase messenger RNA. Neurons 
expressing neurotensin/neuromedin N messenger RNA were observed in the ventral 
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mesencephalon of both schizophrenic and non-schizophrenic humans. PMID: 1436492, UI: 
93063858 Neurotensin is a small neuropeptide of 13 amino acids that may function as a 
neurotransmitter or neuromodulator in the central nervous system. In the CNS, neurotensin is 
localized to the catecholamine-containing neurons. A catecholamine-producing cell line can also 
5 produce NT. Lithium salts, widely used in the treatment of manic-depressive patients, 

dramatically potentiate NT gene expression in this cell line. Gerhard et al. (1989) used a canine 
cDNA as a probe on a somatic cell hybrid panel to determine that the human gene is located on 
chromosome 12. The tridecapeptide neurotensin (162650) is widely distributed in various 
regions of the brain and in peripheral tissues. In the brain, neurotensin acts as a neuromodulator, 

10 in particular of dopamine transmission in the nigrostriatal and mesocorticolirnbic systems, 
suggesting its possible implication in dopamine-associated behavioral neurodegenerative and 
neuropsychiatric disorders. Its various effects are mediated by specific membrane receptors. Vita 
et al. (1993) isolated a cDNA encoding the human neurotensin receptor and showed that it 
predicts a 418-amino acid protein that shares 84% homology with the rat protein. Le et al. (1997) 

1 5 also cloned the human neurotensin receptor (NTR) cDNA and its genomic DNA. The gene is 
encoded by 4 exons spanning more than 10 kb. The authors identified a highly polymorphic 
tetranucleotide Tepeat approximately 3 kb from the gene. Southern blot analysis revealed that the 
NTR gene is present in the human genome as a single-copy gene. Le et ial. (1997) stated that the 
neurotensin receptor has 7 transmembrane spanning regions and high homology to other 

20 receptors that couple to G proteins 

Neurolysin is expressed ubiquitously in the rat brain (Massarelli et al Brain Res 1999 
Dec 18; 85 1(1-2): 261-5; Dauch et al J Neurochem 1992 Nov; 59(5): 1862-7). It has been 
suggested that this enzyme plays a role in the regulation of neurologically active peptides 
(Vincent et al. Br J Pharmacol 1997 Jun; 121(4): 705-10) and activity differs depending on the 

25 cellular source of this enzyme whether it is expressed in primary cultured neurons and astrocytes 

(yincent et al J Neurosci 1996 Aug 15; 16(16): 5049-59). This might play a role in nociception 

and signal transduction in the brain as well as central nervous system. Related endopeptidases 

have been shown to play a role in processing angiotensin and important regulator of blood 

pressure. - 

30 Uses of the Compositions of the Invention 

The expression pattern, map location and protein similarity information for MOL9 

suggest that it may function as neurolysin family. Therefore, the nucleic acids and proteins of 
the invention are useful in potential therapeutic applications implicated, for example but not 
limited to, in various pathologies /disorders as described below and/or other 
35 pathologies/disorders. Potential therapeutic uses for the invention(s) are, for example but not 
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limited to, the following: (i) Protein therapeutic, (ii) small molecule drug target, (iii) antibody 
target (therapeutic, diagnostic, drug targeting/cytotoxic antibody), (iv) diagnostic and/or 
prognostic marker, (v) gene therapy (gene delivery/gene ablation), (vi) research tools, and (vii) 
tissue regeneration in vitro and in vivo (regeneration for all these tissues and cell types 

5 composing these tissues and cell types derived from these tissues).These may also function in 
extracellular matrix remodeling in tissues described above. 

The nucleic acids and proteins of the invention are useful in potential therapeutic 
applications implicated in various diseases and disorders described below and/or other 
pathologies and disorders. For example, but not limited to, a cDNA encoding the neurolysin - 

1 0 like protein may be useful in gene therapy, and the neurolysin -like protein may be useful when 
administered to a subject in need thereof. By way of nonlimiting example, the compositions of 
the present invention will have efficacy for treatment of patients suffering from Cancer, Trauma, 
Viral/bacterial/parasitic infections, Cardiomyopathy, Atherosclerosis, Hypertension, Congenital 
heart defects, Aortic stenosis, Atrial septal defect (ASD), Atrioventricular (A-V) canal defect, 

15 Ductus arteriosus, Pulmonary stenosis, Subaortic stenosis, Ventricular septal defect (VSD), valve 
diseases, Tuberous sclerosis, Scleroderma, Obesity, Transplantation, Atherosclerosis, 
Aneurysms, Hypertension, Fibromuscular dysplasia, Stroke, Scleroderma, Fertility, Diabetes, 
Vein Hippel-Lindau (VHL) syndrome , Pancreatitis, Hirschsprung's disease , Crohn's Disease, 
Appendicitis, Alzheimer's disease, Stroke, Hypcrcalceimia, Parkinson's disease, Huntington's 

20 disease, Cerebral palsy, Epilepsy, Lesch-Nyhan syndrome, Multiple sclerosis, Ataxia- 
telangiectasia, Leukodystrophies, Behavioral disorders, Addiction, Anxiety, Pain, Systemic 
lupus erythematosus , Autoimmune disease, Asthma, Emphysema, Scleroderma, Autoimmune 
disease, Renal artery stenosis, Interstitial nephritis, Glomerulonephritis, Polycystic kidney 
disease, Renal tubular acidosis, IgA nephropathy, Hypercalceimia, Lesch-Nyhan syndrome. The 

25 novel nucleic acid encoding the neurolysin-like protein, and the neurolysin -like protein of the 
invention, or fragments thereof, may further be useful in diagnostic applications, wherein the 
presence or amount of the nucleic acid or the protein are to be assessed. These materials are 
further useful in the generation of antibodies that bind immunospecifically to the novel 
substances of the invention for use in therapeutic or diagnostic methods 

30 

MOL10 

MOLlOa 

A novel nucleic acid encoding a protein bearing sequence similarity to Cyclic- 
Nucleotide-Gated Olfactory Channel -like protein was identified by TblastN using CuraGen 
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Corporation's sequence file for MOL10 probe or homolog, run against the Genomic Daily Files 
made available by GenBank. The nucleic acid was further predicted by the program GenScan™, 
including selection of exons. These were further modified by means of similarities using 
BLAST searches. The sequences were then manually corrected for apparent inconsistencies, 
5 thereby obtaining the sequences encoding the full-length protein. The disclosed novel MOLlOa 
nucleic acid of 1 835 nucleotides (also referred to as GM9 896064 7_A) is shown in Table 10A. 
An open reading frame begins with an ATG initiation codon at nucleotides 54-56 and ends with 
a TGA codon at nucleotides 1788-1790. A putative untranslated region upstream from the 
initiation codon and downstream from the termination codon are underlined in Table 10A, and 
10 the start and stop codons are in bold letters. 

The nucleic acid sequence has 1 536 of 1733 bases (88%) identical to a Rattus norvegicus 
Cyclic-Nucleotide-Gated Olfactory Channel ocnc2 mRNA (GENBANK-ID: U12623) (Expect = 
5.2e 108 ). 



Table 10A. MOLlOa Nucleotide Sequence (SEQ ID NO:27) 

TAC AG GCAG AG AGGGT GT GGACATCT C^CACCCC AGCACC AG ACCACAGAACCA TGAGCCAGGACACC AAA 

GTGAAGACAACAGAGTCCAGTCCCCCAGCCCCATCCAAGGCCAGGAGGAAGTTGCTGCCTGTCCTGGACCC 

ATCTGGGG AT T ACTACT AC7GGTGGCTGAACACAATGGT CTTCCCAGT CATGT ATAACCTCATCAT CCTCG 

TGTGCAGGGCCTGCTTCCCCGACTTGCAGCACGGTTATCTGGTGGCCTGGTTGGTGCTGGACTACACGAGT 

GACCTGCTATACCTACTAGACATGGTGGTGCGCTTCCACACAGGTGGATTCTTGGAACAGGGCATCCTGGT 

GGTGGACAAGGGTAGGATCTCGAGTCGCTACGTTCGCACCTGGAGTTTCTTCTTGGACCTGGCTTCCCTGA 

TGCCCACAGATGTGGTCTACGTGCGGCTGGGCCCGCACACACCCACCCTGAGGCTGAACCGCTTTCTCCGC 

GCGCCCCGCCTCTTCGAGGCCTTCGACCGCACAGAGACCCGCACAGCTTACCCAAATGCCTTTCGCATTGC 

C^GCTGATGCTTTACATTTTTGTCGTCATCCATTGGAACAGCTGCCTATACTTTGCCCTATCCCGGTACC 

TGGGCTTCGGGCGTGACGCATGGGTGTACCCGGACCCCGCGCAGCCTGGCTTTGAGCGCCTGCGGCGCCAG 

TACCT CTAT AGCT TT T AC TTCTCC ACGCTGAT ACTGACT ACAGTGGGCGATACACCGCCGCCAG CCAGGGA 

AGAAGAGTACCTCTTCATGGTGGGCGACTTCCTGCTGGCCGTCATGGGTTTCGCCACCATCATGGGTAGCA 

•TGAGC TCTGTCAT CT AC AACATGAACAC TGCAG ATGCGGC T TTCT ACCC AG ATCATGCACT GGT GAAGAAG 

TAC AT GAAGCTGC AGC AC GTC AAC CGCAAGCTGGAGCGGCGAGTT ATT G ACTGGTATCAGC ACCTGCAGAT 

CAACAAGAAGATGACCAACGAGGTAGCCATCTTACAGCACTTGCCTGAGCGGCTGCGGGCAGAAGTGGCTG 

TGTCTGTGCACCTGTCCACTCTGAGCCGGGTGCAGATCTTTPU^CTGTGAGGC(^CCTGCTGGAGGAG ■ 

CTGGTGCTGAAGCTGCAGCCCCAGACCTACTCACCAGGTGAATATGTATGCCGCAAAGGAGACATTGGCCA 

AG AGATGT ACATC ATCCGAGAGGGTCAACTG GCCGTGGTGGC AGATGATG GT ATCACACAGT AT GCTGTGC 

TCGGTGCAGGGCTCTACTTTGGGGAGATCAGCATCATCAACATCAAAGGTGGGAACATGTCTGGGAACC^ 

CGC AC AGCGAACATCAAG AGC CT AGGTT ATT CAG ACCT AT T CTGCCT G AG CAAGG AGGAC CT GC GGGAGGT 

GCTGAGCGAGTATCCACAAGCACAGACCATCATGGAGGAGAAAGGACGTGAGATCCTGCTGAAAATGAACA 

AGTTGGACGTGAATGCTGAGGCAGCTGAGATCGCCCTGCAGGAGGCCACAGAGTCCCGGCTACGAGGCCTA 

GACC^GCAGCTGGATGATCTACAGACCAAGTTTGCTCGCCTCCTC 

GATTGCTT ACCGCATT GAACGGCT GGAGTGGC AG ACTCGAGAGTGGCCAATGCCCG AGGAC CTGGCTGAGG 
CTG AT CACGAGGGTG AG CCTG AGGAGGGAACTTCCAAAGATG AAGAGGGCAGGGCCAGCCAGGAGGGACCC 
CCAGGTCCAGAGTGACCCCATCCCCATCCCCAGGATTCCCACCTCCTAGTGAATCCAGAG 

15 

The MOLlOa protein encoded by SEQ ID NO:27 has 638 amino acid residues, and is 
presented using the one-letter code in Table 10B (SEQ ID NO:28). PSORT analysis predicts the 
protein of the invention to be localized in the plasma membrane with a certainty of 0. 6000. 
Using the SIGNALP analysis, it is predicted that the protein of the invention has a signal peptide 
20 with most likely cleavage site between positions 57 and 58. 
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Table 10B. Encoded MOLlOa protein sequence (SEQ ID NO:28) 

MSQDTKVKTTESSPPAPSKARRKLLPVLDPSGDYYY\TOI^ITtWFPVMYNLIILVCEU\CFPDl^HGYLVA^ 
VLDYT S DLLYLLDMWRFHTGG FLEQG I LWDKGRIS SRYVRTWS FFL DliASLMPT DWYVRLGPHT PTLR 
LNRFLRAPRLFEAFDRTETRTAYPNAFRIAKLMLYIFWIHWNSCLYFALSRYLGFGRDAWVYPDPAQPGF 
ERLRRQYLYSFYFSTLILTTVGDTPPPAREEEYLFMVGDFLLAVMGFATIMGSMSSVIYNMNTADAAFYPD 
HALVKKYMKLQHVNRKLERRVIDWYQHLQINKKMTNEVAILQHLPERLRAEVAVSVHLSTLSRVQIFQNCE 
AS LLEELVLKLQPQT YS PGE YVCRKGDI GQEMY 1 1 REGQLAWADDG I TQYAVLGAGLYFG EI S I IK IKGG 
NM SGN RRTAN IKS LGYS DLFCLS KEDLRE VLS E Y PQAQTIMEEKGRE I LLKMNKLDVN AEAAE I ALQEATE 
SRLRGLDQC>IiDDLQTKFARLIJ^I£SSALKIAYRIERLSWQTREWPMPEDIiAEADDEGEPEEGTSKDEEGR 
ASQEGPPGPE 

The full amino acid sequence of the protein of the invention was found to have 1068 of 
1649 amino acid residues (64%) identical to, and 1068 of 1649 residues (64%) positive with, the 
575 amino acid residue Cyclic-Nucleotide-Gated Olfactory Channel ocnc2 subunit protein from 
5 Rattus fion'egicus (ptnr:SPTREMBL-ACC: Q64359) (E value - S.Se- 54 ), and .292 of 556 amino 
acid residues (52%) identical to, and 404 of 556 residues (72%) positive with, the 694 amino 
acid residue Cone Photoreceptor cGMP-Gated Channel Alpha Subunit Homo sapiens (Human) 
(ptnr: TREMBLNEW -ACC: AAC 17440) (E value = 5.8e- 157 ) 
MOLlOb 

10 In the present invention, the target sequence identified previously, MOL1 Oa Accession 

Number GM98960647_A (also known as CG54557-01), was subjected to the exon linking 
process to confirm the sequence. PCR primers were designed by starting at the most upstream 
sequence available, for the forward primer, and at the most downstream sequence available for 
the reverse primer. In each case, the sequence was examined, walking inward from the 

15 respective termini toward the coding sequence, until a suitable sequence that is either unique or 
highly selective was encountered, or, in the case of the reverse primer, until the stop codon was 
reached. Such primers were designed based on in silico predictions for the full length cDNA, 
part (one or more exons) of the DNA or protein sequence of the target sequence, or by translated 
homology of the predicted exons to closely related human sequences sequences from other 

20 species. These primers were then employed in PCR amplification based on the following pool of 
human cDNAs: adrenal gland, bone marrow, brain - amygdala, brain - cerebellum, brain - 
hippocampus, brain - substantia nigra, brain - thalamus, brain -whole, fetal brain, fetal kidney, 
fetal liver, fetal lung, heart, kidney, lymphoma - Raji, mammary gland, pancreas, pituitary gland, 
placenta, prostate, salivary gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, 

25 testis, thyroid, trachea, uterus. Usually the resulting amplicons were gel purified, cloned and 
sequenced to high redundancy. The resulting sequences from all clones were assembled with 
themselves, with other fragments in CuraGen Corporation's database and with public ESTs. 
Fragments and ESTs were included as components for an assembly when the extent of then- 
identity with another component of the assembly was at least 95% over 50 bp. In addition, 
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sequence traces were evaluated manually and edited for corrections if appropriate. These 
procedures provide the sequence reported below, which is designated MOLlOb (Accession 
Number CG54557-02). This differs from the previously identified sequence [GM98960647_A ) 
(also_known as CG54557-01)] at aminoacid position 1 59 T->A and has dekjions at positions 22/ 
5 R, 93 G and 426 G. ) 

The disclosed novel MOLlOb nucleic acid of 2551 nucleotides (also referred to as 
CG54557-02) is shown in Table 9D. An open reading frame begins with an ATG initiation 
codon at nucleotides 779-781 and ends with a TGA codon at nucleotides 2504-2506. A putative 
untranslated region upstream from the initiation codon and downstream from the termination 
1 0 codon are underlined in Table 1 OA, and the start and stop codons are in bold letters. 

The nucleic acid sequence has 1 1625 of 1 857 bases (87%) identical to a gb:GENBANK- 
E):RNU12425|acc:U12425.1 mKNA from Rattus norvegicus (Rattus norvegicus olfactory cyclic 
nucleotide-gated channel mRNA, complete cds) (Expect = 4.0e" 3 16 ). 



Table 10D. MOLlOb Nucleotide Sequence (SEQ ID NO:29) 

GTTTTTGTTGTTTTGATATAGGAGRTATTGARGCAGGTTCACAAAARGAGARAAGTTGAAAGATTGGGGAC 

CATAAAACACATGGAATGGTTGGTAGGATCAGGCACTAGAAGTCACAAGAAGGATATGAGGACAAAAGCAC 

CATAGGATGGCCCCTATCACACTACCTATGAGAATGGTGTGATGGGGGAAGGCGTATGTGGAGGTAGATAA 

GGGTAGGAAGTAGGTTACAAAAATAGAGCTCACTTCTCATGTGAGAGGCATCTCTTTGTCCCTGGAGAATA 

GTTTAGCACCTGACATAGATAAGCCATTCAGTAATAGTTGTTAAATAAATAAATAGTGAGGCCCAAATAGA 

AT T TG C AAAG AT AAAAC AG AGT G TTTG ATCC T AC ACT AAAACTG AGGT CTT CTGACCCAG AGGACACCT AT 

GTAGCTCAGTTGCTGTGGAAAGAGGGGAGGAGGAAAACAGAGACAAGACTCAGGCTTCCCTCTGAGGCATG 

CACCCCCACCTTCTCCAGGGATCTCATTAGAGGTGTTTAGCTGGGCAGGTGTAAGCCCAGGCCCTGGGAGA 

CAGGGCAGAGTGCTAGAGCTAGACTGTCTCCACCCCTTCAGTAGCGCTAGCTCTGGTTGTGTTGCTAAGAG 

CCCCAAAGACAAAGAAGTCACAGCAGAAGCCCAACAGCAGCCTCCTTCAGGCAGTCAGGCACTAGTGCCCA 

AC TCC AG AAGTCC CCT AC AGG C AGAG AGGGTGTGG AC ATCTCACACC C CAGC ACCAGRCCACAG AACCA TG 

AGCCAGGAC^CC^U^GTGAAGACAACAGAGTCCAGTCCCCCAGCCCCATCC^UVGGCCAGGAAGTTGCTGCC 

TGTCCTGGACCCATCTGGGGATTACTACTACTGGTGGCTGAACACAATGGTCTTCCCAGTCATGTATAACC 

TCATCATCCTCGTGTGCAGAGCCTGCTTCCCCGAOTGCAGCACGGTTATCTGGTGGCCTGGTTGGTGCTG 

GACTAC^CGAGTGACCTGCTATACCTACTAGACATGGTGGTGCGCTTC(^CA(^GGATTCTTGGAA(^GGG 

CATCCTGGTGGTGGACAAGGGTAGGATCTCGAGTCGCTACGTTCGCACCTGGAGTTTCTTCTO 

CTTCCCTGATGCCCACAGATGTGGTCTACGTGCGGCTGGGCCCGCACACACCCACCCTGAGGCTGAACCGC 

TTTCTCCGCGCGCCCCGCCTCTTCGAGGCCTTCGACCGCGCAGAGACCCGCACAGCTTACCCAAATGCCTT 

TCGCATTGCC^^GCTGATGCTTTACATTTTTGTCGTCyVTCCA 

CCCGGTACCTGGGCTTCGGGCGTGACGCATGGGTGTACCCGGACCCCGCGCAGCCTGGCTTTGAGCGCCTG 
CGGCGCC AGT ACCTCT AT AGCTT TTACT TCTCCACGCTGATACTG ACT ACAGTGGGCGATACACCGCCGCC 
AGCC^GGGAAGAAGAGTACCTCTT<^TGGTGGGCGACTTCCTGCTGGCCGTCATGGGTTTCGCCACC^^CA 
T G GGT AGC ATGAGCTCTGTCATCT ACAACATGAACACTGCAGATGCGGCTTTCTACCCAGATCATGCACTG 
GTGAAGAAGTAC^TGAAGCTGCAGC^CGTCAACCGCAAGCTGGAGCGGCGAGTTATTGACTGGTATCAG 
CCTGCAGATCAACAAGAAGATGAC(^CC^GGTAGCC^TCrTAC^GC^CTTGCCTGAGCGGCTGCGGGCAG 
AAGTGGCTGTGTCTGTGCACCTGTCCACTCTGAGCCGGGTGCAGATCTTTCAGAACTGTGAGGCCAGCCTG 
CTGGAGG AGCTGGTGCT G AAG CTGCAGCCCCAGACCTACTCACCAGGTGAATATGTAT GCCGCAAAGGAGA 
CATTGGCCAAGAGATGT ACAT CATCCGAGAGG GTCAACTGGCCGTGGTGG CAG ATGATGGT ATCAC ACAGT 
ATGCTGTGCTCGGTGCAGGGCTCTACTTTGGG G AGATCAGCATCATCAACATC AAAGGGAACAT GTCTGGG 
AACCG CCGCACAGCC AACAT C AAG AGCCT AGGTTATTCAG ACCTATTCTGCCT GAGCAAGG AGGACCTGCG 
GGAGGTGCTGAGCGAGTATCCACAAGCACAGACCATCATGGAGGAGAAAGGACGTGAGATCCTGCTGAAAA 
TGAACAAGTTGGACGTGAATGCrGAGGCAGCTGAGATCGCCCTGCAGGAGGCCACAGAGTCCCGGCTACGA 
GGCCT AGACCAGCAGCTGGAT GATCTACAGACCAAGTTTGCTCGCCTCCT GGCTGAGCTGGAGTCCAGCGC 
ACTT AAG ATTGCTT ACCGCAT TGAACGGCTGG AGTGGCAGACTCGAGAGT GGCCAATGCCCGAGGACCTGG 
CTGAGGCTGATGACGAGGGTGAGCCTGAGGAGGGAACTTCC^^GATGAAGAGGGC^GGGCCAGCC^GGAG 
GGACCCCCAGGTCCAGAGTGACCCCATCCCCATCCCCAGGATTCCCACCTCCTAGTGAATCCAGAG 

15 
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The MOL9a protein encoded by SEQ ID NO:29 has 575 amino acid residues, and is 
presented using the one-letter code in Table 10B (SEQ ID NO:30). PSORT analysis predicts the 
protein of the invention to be localized in the plasma membrane with a certainty of 0. 6000. 
Using the SIGNALP analysis, it is predicted that the protein of the invention has a signal peptide 
5 with most likely cleavage site between positions 56 and 57 (CRA-CF) 



Tabic 10E. Encoded MOLlOb protein sequence (SEQ ID NO:30) 

.LDYTSDLLYI^DMWRFHTGFI^^ILWDKGRIS^^ 
RFLRAPIUJFEAFDRAETRTAYPN&FRIAKLMLYIF 

LRRQYLYSFYFSTLILTTVGDTPPPAREEEYLFMVGDFIJ^VMGFATIMGSMSSVIYNMNTADAAFYPDHA 
LVKKYMKLQHVNRKLERRVI DWYQHLQI K KKMTNEVAI LQHLPERLRAEVAVS VHLSTLSRVQI FQNCEAS 
LLEELVLKLQPQTYSPGEYVCRKGDIGQEMYIIREGQIJVWADDGITQYAVLGAGLYFGEISIIKIKGNMS 
GNRRTANIKSl^YSDLFCLSKEDLREVLSEYPQAQTIMEEKGREILLKMNKLDVNAEAAEIALQEATESRL 
RGLDQQLDDLQTKFARLLAELESSALKIAYRIERLEWQTREWPHPEDLAEADDEGEPEEGTSKDEEGRASQ 
EGPPGPE 

The full amino acid sequence of the protein of the invention was found to have 536 of 
575 amino acid residues (93%) identical to, and 552 of 575 amino acid residues (96%) similar to, 
10 the 575 amino acid residue ptnr:SWISSPROT-ACC:Q64359 protein from Rattus norvegicus 
(Rat) (CYCLIC-NUCLEOTIDE-GATED OLFACTORY CHANNEL OCNC2 SUBUNIT) (E 
value = 4.2e- 287 ) 

Chromosomal information: 

The Cyclic-nucleotide gated olfactory chaimebpchc&disclosed in this invention maps to 
1 5 chromosome 1 1 . This information was assigned using OMIM, the electronic northern 
bioinformatic tool implemented by CuraGen Corporation, public ESTs, public literature 
references and/or genomic clone homologies. This was executed to derive the chromosomal 
mapping of the SeqCalling assemblies, Genomic clones, literature references and/or EST 
sequences that were included in the invention. 

20 Tissue expression 

The Cyclic-nucleotide gated olfactory channel ocnc2 disclosed in this invention is 
expressed in at least the following tissues: Adrenal gland, bone marrow, brain - amygdala, brain 
- cerebellum, brain - hippocampus, brain - substantia nigra, brain - thalamus, brain -whole, fetal 
brain, fetal kidney, fetal liver, fetal lung, heart, kidney, lymphoma - Raji, mammary gland, 

25 pancreas, pituitary gland, placenta, prostate, salivary gland, skeletal muscle, small intestine, 
spinal cord, spleen, stomach, testis, thyroid, trachea, uterus. . This information was derived by 
detennining the tissue sources of the sequences that were included in the invention including but 
not limited to SeqCalling sources, Public EST sources, and/or RACE sources. 
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In addition, the sequence is predicted to be expressed in the following tissues because of 
the expression pattern of (GENBANK-1D: gb:GENBAWK-lD:RNU12425|acc:U12425.1) a 
closely related Rattus norvegicus olfactory cyclic nucleotide-gated channel mRNA, complete cds 
homolog in species Rattus norvegicus : olfactory neuroepithelium. 

5 These materials are further useful in the generation of antibodies that bind immuno- 

specifically to the novel MOLlOb substances for use in therapeutic or diagnostic methods. 
These antibodies may be generated according to methods known in the art, using prediction from 
hydrophobicity charts, as described in the "Anti-MOLX Antibodies" section below. For 
example the disclosed MOLlOb protein has multiple hydrophilic regions, each of which can be 

1 0 used as an immunogen. In one embodiment, a contemplated MOL1 Ob epitope is from about 

amino acids 25 to 75. In another embodiment, a MOLlOb epitope is from about amino acids 1 to 
30. In further embodiments, MOLlOb epitopes are found in amino acids 150-250, 275-350, 375- 
400. and 425-560. These novel proteins can also be used to develop assay system for functional 
analysis. 

1 5 Homology between the MOL1 0 isoforms and other homologous proteins is presented 

graphically in the multiple sequence alignment given in Table 91 (with MOLlOa being shown on 
line 1 and MOLlOb on line 2) as a ClustalW analysis comparing MOL10 with related protein 
sequences. 



20 



25 



Table Information for the ClustalW proteins: 

1) MOLlOa (SEQ ID N0:28) 

2) MOLlOb (SEQ ID NO: 30) 

3) S35691 cyclic nucleotide-gated channel protein - rabbit (SEQ ID NO: 73) 

4) Q64359 Cyclic-Nucleotide-Gated Olfactory Channel ocnc2 subunit protein from. 
-Rattus norvegicus (SEQ IT) NO: 74). 

5) AAC17440 Cone Photoreceptor cGMPrGated Channel Alpha Subunit Homo sapiens 
(SEQ ID NO:75) 
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MOLlOa Pro 
MOLlOb Pro 
S35691 
Q64359 
AAC17440 



MOLlOa Pro 
MOLlOb Pro 
S3S691 
Q64359 
AAC17440 



MOLlOa Pro 
MOLlOb Pro 
£35691 
Q64359 
AAC17440 



MSSWRSCARA PLSGSAWRRS AATRRSRRCL KTKRKRWSSG KGTPMQSTQC ETRRRAQTPC 60 
MAKIHTQYSH P-S RTH LKVKTSDRDL NRAENGLSRA HSS SEET SSVLQPGIAM 52 
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RLAEMDAPQQ RRGGFRRIVR LVGVIRQWAN RN FREE EAR P DSFgERFRGP ELQTVTTQOG 179 
EBSSQBjjNAQ AN VGSQeEaDKG KSAwfi -LAKCBTNTS 144 
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Q64359 b^GkhM^AH Uc UdikW ^GEl EKBGRAErasE iH 575 
AAC17440 pB^DGEVPEP. ATKTEDgQQ 694 



Cyclic nucleotide-gated (CNG) channels play central roles in visual and olfactory signal 
transduction. In the retina, rod photoreceptors express the subunits CNCalphal and CNCbetala. 
In cone photoreceptors, only CNCalpha2 expression has been demonstrated so far. Rat olfactory 
5 sensory neurons (OSNs) express two homologous subunits, here designated CNCalpha3 and 
CNCalpha4. This paper describes the characterization of CNCbetalb, a third subunit expressed 
in OSNs and establishes it as a component of the native channel. CNCbetalb is an alternate 
splice form of the rod photoreceptor CNCbetala subunit. Analysis of niRNA and protein 
expression together suggest co-expression of all three subunits in sensory cilia of OSNs. From 
1 0 single-channel analyses of native rat olfactory channels and of channels expressed 

heterologously from all possible combinations of the CNCalpha3, -alpha4, and -beta lb subunits, 
we conclude that the native CNG channel in OSNs is composed of all three subunits. Thus, CNG 
channels in both rod photoreceptors and olfactory sensory neurons result from coassembly of 
specific alpha subunits with various forms of an alternatively spliced beta subunit 
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Phototransduction is mediated by an enzymatic cascade that ultimately leads to the 
hydrolysis of cGMP. The photoreceptor cells, rods and cones, integrate and respond to cGMP 
hydrolysis via a cGMP-gated cation channel in the plasma membrane of the outer segment. 
Kaupp et al. (1989) cloned this channel from bovine retina. Dhallan et al. (1991) used the bovine 
5 sequence to isolate cDNA and genomic DNA encompassing the entire protein coding region of 
the human homolog. Assignment to chromosome 4 was achieved by study of somatic cell 
hybrids. Pittler et al. (1992) determined the primary structures of the human and mouse retinal 
rod cGMP-gated cation channel by analysis of cDNA clones and amplified DNA. The open 
reading frames predicted polypeptides of 690 and 683 residues, respectively, exhibiting 88% 

10 sequence similarity. Significant sequence similarity (59%) of the visual cGMP-gated channel to 
the olfactory cAMP-gated channel was pointed out The RNA transcript was found to be 3.2 kb 
long in human, mouse, and dog. By PCR used in connection with somatic cell hybrid DNAs, 
Pittler et al. (1992) mapped the CNCG gene to 4pl4-ql3 near the centromere. By interspecific 
backcross haplolype analysis, the corresponding gene in the mouse, Cncg, was mapped to a site 

15 0.9 cM proximal to the Kit locus on chromosome 5. Griffin et al. (1993) mapped the CNCG1 
gene to 4pl2-cen by fluorescence in situ hybridization. It is noteworthy that the rod cGMP PDE 
beta polypeptide (PDEB; 180072) also maps to 4p, at 4pl6.3. 
Uses of the Compositions of the Invention 

The protein similarity information, expression pattern, and map location for MOL10 

20 suggest that it may have important structural and/or physiological functions characteristic of the 
Cyclic-nucleotide gated channel family. Therefore, the nucleic acids and proteins of the 
invention are useful in potential diagnostic and therapeutic applications and as a research tool. 
These include serving as a specific or selective nucleic acid or protein diagnostic and/or 
prognostic marker, wherein the presence or amount of the nucleic acid or the protein are to be 

25 assessed, as well as potential therapeutic applications such as the following: (i) a protein 

therapeutic, (ii) a small molecule drug target, (iii) an antibody target (therapeutic, diagnostic, 
drug targeting/cytotoxic antibody), (iv) a nucleic acid useful in gene therapy (gene delivery/gene 
ablation), and (v) a composition promoting tissue regeneration in vitro and in vivo (vi) biological 
defense weapon. 

30 The nucleic acids and proteins of the invention are useful in potential diagnostic and 

therapeutic applications implicated in various diseases and disorders described below and/or 
other pathologies. For example, the compositions of the present invention will have efficacy for 
treatment of patients suffering from: 
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color blindness, CNS developmental disorders and other diseases, disorders and 
conditions of the like. 

These materials are further useful in the generation of antibodies that bind 
immunospecifically to the novel substances of the invention for use in therapeutic or diagnostic 
5 methods. 

A summary of the MOLX nucleic acids and proteins of the invention is provided in Table 

11. 



TABLE 11: Summary Of Nucleic Acids And Proteins Of The Invention 



Name 


Tables 


fin no* T)f*cprlntlnn nf Hnmnlno 


Nucleic 
Acid 

SEQ ID 
NO 


A m in n 

AlUIUU 

Acid 
SEQ ID 
NO 


MOLl 


1A, IB, 


MOLl : GM_79960 1 78 


1 


2 


MOL2 


2A, 2B 


MOL2: 20466828_EXT1 


3 


4 


MOL3 


3A,3B 


MOO: 82254077.0.1 


5 


6 


MOL4 


4A, 4B, 


MOL4: AC004826 


7 


8 


MOL5 


5A, 5B, 


MOL5: AC025535 


9' 


10 


MOL6 


6A.6B 


MOL6a: OM_87760758_A 


11 


12 




6D,6E 


MOI^b: GM 87760758 A da 


13 


14 


MOL7 


7 A, 7B 


MOL7: 30675745.0.499 


15 


16 


MOL8 


8A, 8B 


MOL8a: 11800699-0-16 


17 


18 




8D.8E 


MOL8b: CG56222-01 


19 


20 


MOL9 


9A.9B 


MOL9a: 19506719_B_EXT 


21 


22 




9D.9E 


MOL9b: 1 950671 9_B_EXT-S773 


23 


24 




9F, 9G 


MOL9c: CG56222-01 


25 


26 


MOL10 


10A, 10B 


MOLlOa GM98960647_A 


27 


28 . 




10D, 10E 


MOLlOb CG54557-02 


29 


30 



MOLX Nucleic Acids and Polypeptides 

One aspect of the invention pertains to isolated nucleic acid molecules that encode 
MOLX polypeptides or biologically active portions thereof. Also included in the invention are 
nucleic acid fragments sufficient for use as hybridization probes to identify MOLX-encoding 
nucleic acids (e.g. 9 MOLX mRNAs) and fragments for use as PCR primers for the amplification 
and/or mutation of MOLX nucleic acid molecules. As used herein, the term ''nucleic acid 
molecule" is intended to include DNA molecules (e.g., cDNA or genomic DNA), RNA 
molecules (e.g. , mRNA), analogs of the DNA or RNA generated using nucleotide analogs, and 
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derivatives, fragments and homologs thereof. The nucleic acid molecule may be single-stranded 
or double-stranded, but preferably is comprised double-stranded DNA. 

An MOLX nucleic acid can encode a mature MOLX polypeptide. As used herein, a 
"mature" form of a polypeptide or protein disclosed in the present invention is the product of a 
5 naturally occurring polypeptide or precursor form or proprotein. The naturally occurring 
polypeptide, precursor or proprotein includes, by way of nonlimiting example, the full-length 
gene product, encoded by the corresponding gene. Alternatively, it may be defined as the 
polypeptide, precursor or proprotein encoded by an ORF described herein. The product 
"mature** form arises, again by way of nonlimiting example, as a result of one or more naturally 

10 occurring processing steps as they may take place within the cell, or host cell, in which the gene 
product arises. Examples of such processing steps leading to a "mature" form of a polypeptide 
of protein include the cleavage of the N-terminal methionine residue encoded by the initiation 
codon of an ORF, or the proteolytic cleavage of a signal peptide or leader sequence. Thus a 
mature form arising from a precursor polypeptide or protein that has residues 1 to N, where 

1 5 residue 1 is the N-terminal methionine, would have residues 2 through N remaining after 

removal of the N-terminal methionine. Alternatively, a mature form arising from a precursor 
polypeptide or protein having residues 1 to N, in which an N-terminal signal sequence from 
residue 1 to residue M is cleaved, would have the residues from residue M-H to residue N 
remaining. Further as used herein, a "mature" form of a polypeptide or protein may arise from a 

20 step of post-translational modification other than a proteolytic cleavage event. Such additional 
processes include, by way of non-limiting example, glycosylation, myristoylation or 
phosphorylation. In general, a mature polypeptide or protein may result from the operation of 
only one of these processes, or a combination of any of them. 

The term "probes", as utilized herein, refers to nucleic acid sequences of variable length, 

25 preferably between at least about 10 nucleotides (nt), 100 nt, or as many as approximately, eg., 
6,000 nt, depending upon the specific use. Probes are used in the detection of identical, similar, 
or complementary nucleic acid sequences. Longer length probes are generally obtained from a 
natural or recombinant source, are highly specific, and much slower to hybridize than shorter- 
length oligomer probes. Probes may be single- or double-stranded and designed to have 

30 specificity in PCR, membrane-based hybridization technologies, or ELISA-like technologies. 

The term "isolated" nucleic acid molecule, as utilized herein, is one, which is separated 
from other nucleic acid molecules which are present in the natural source of the nucleic acid. 
Preferably, an "isolated" nucleic acid is free of sequences which naturally flank the nucleic acid 
(i.e., sequences located at the 5 - and 3-termini of the nucleic acid) in the genomic DNA of the 
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organism from which the nucleic acid is derived. For example, in various embodiments, the 
isolated MOLX nucleic acid molecules can contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 
0.5 kb or 0.1 kb of nucleotide sequences which naturally flank the nucleic acid molecule in 
genomic DNA of the cell/tissue from which the nucleic acid is derived (e.g., brain, heart, liver, 
5 spleen, etc.). Moreover, an "isolated" nucleic acid molecule, such as a cDNA molecule, can be 
substantially free of other cellular material or culture medium when produced by recombinant 
techniques, or of chemical precursors or other chemicals when chemically synthesized. 

A nucleic acid molecule of the invention, e.g., a nucleic acid molecule having the 
nucleotide sequence of SEQ ID NOS:l, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, and 29, or a 

1 0 complement of this aforementioned nucleotide sequence, can be isolated using standard 

molecular biology techniques and the sequence information provided herein. Using all or a 
portion of the nucleic acid sequence of SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19,21,23,25, 
27, and 29 as a hybridization probe, MOLX molecules can be isolated using standard 
hybridization and cloning techniques (e.g., as described in Sambrook, et aL, (eds.), MOLECULAR 

15 Cloning: A Laboratory Manual 2 nd Ed., Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY, 1989; and Ausubel, et aL, (eds.), Current Protocols IN Molecular Biology, 
John Wiley & Sons, New York, NY, 1993 .) 

A nucleic acid of the invention can be amplified using cDNA, mRNA or alternatively, 
genomic DNA, as a template and appropriate oligonucleotide primers according to standard PCR 

20 amplification techniques. The nucleic acid so amplified can be cloned into an appropriate vector 
and characterized by DNA sequence analysis. Furthermore, oligonucleotides corresponding to 
MOLX nucleotide sequences can be prepared by standard synthetic techniques, e.g., using an 
automated DNA synthesizer. 

As used herein, the term "oligonucleotide" refers to a series of linked nucleotide residues, 

25 which oligonucleotide has a sufficient number of nucleotide bases to be used in a PCR reaction. 
A short oligonucleotide sequence may be based on, or designed from, a genomic or cDNA 
sequence and is used to amplify, confirm, or reveal the presence of an identical, similar or 
complementary DNA or RNA in a particular cell or tissue. Oligonucleotides comprise portions 
of a nucleic acid sequence having about 10 nt, 50 nt, or 1 00 nt in length, preferably about 15 nt 

30 to 30 nt in length. In one embodiment of the invention, an oligonucleotide comprising a nucleic 
acid molecule less than 100 nt in length would further comprise at least 6 contiguous nucleotides 
of SEQ IDNOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, and 29, or a complement thereof. 
Oligonucleotides may be chemically synthesized and may also be used as probes. 
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In another embodiment, an isolated nucleic acid molecule of the invention comprises a 
nucleic acid molecule that is a complement of the nucleotide sequence shown in SEQ ID NOS: 1 , 
3,5,7, 9, 11, 13, 15, 17, 19,21,23,25, 27, and 29, or a portion of this nucleotide sequence (e.g., 
a fragment that can be used as a probe or primer or a fragment encoding a biologically-active 
5 portion of an MOLX polypeptide). A nucleic acid molecule that is complementary to the 

nucleotide sequence shown in SEQ ED NOS: 1 , 3, 5, 7, 9, 11, 13, 15, 17, 19,21,23,25, 27, and 
29 is one that is sufficiently complementary to the nucleotide sequence shown in SEQ ID 
NOS: 1 , 3, 5, 7, 9, 1 1 , 1 3, 1 5, 1 7, 19, 21, 23, 25, 27, and 29 that it can hydrogen bond with little 
or no mismatches to the nucleotide sequence shown SEQ ID NOS:l , 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 

10 21 , 23, 25, 27, and 29, thereby forming a stable duplex. 

As used herein, the term "complementary" refers to Watson-Crick or Hoogsteen base 
pairing between nucleotides units of a nucleic acid molecule, and the term "binding" means the 
physical or chemical interaction between two polypeptides or compounds or associated 
polypeptides or compounds or combinations thereof. Binding includes ionic, non-ionic, van der 

1 5 Waals, hydrophobic interactions, and the like. A physical interaction can be either direct or 
indirect. Indirect interactions may be through or due to the effects of another polypeptide or 
compound. Direct binding refers to interactions that do not take place through, or due to, the 
effect of another polypeptide or compound, but instead are without other substantial chemical 
intermediates. 

20 Fragments provided herein are defined as sequences of at least 6 (contiguous) nucleic 

acids or at least 4 (contiguous) amino acids, a length sufficient to allow for specific hybridization 
in the case of nucleic acids or for specific recognition of an epitope in the case of amino acids, 
respectively, and are at most some portion less than a full length sequence. Fragments may be 
derived from any contiguous portion of a nucleic acid or amino acid sequence of choice. 

25 Derivatives are nucleic acid sequences or amino acid sequences formed from the native 

compounds either directly or by modification or partial substitution. Analogs are nucleic acid 
sequences or amino acid sequences that have a structure similar to, but not identical to, the native 
compound but differs from it in respect to certain components or side chains. Analogs may be 
synthetic or from a different evolutionary origin and may have a similar or opposite metabolic 

30 activity compared to wild type. Homologs are nucleic acid sequences or amino acid sequences 
of a particular gene that are derived from different species. 

Derivatives and analogs may be full length or other than full length, if the derivative or 
analog contains a modified nucleic acid or amino acid, as described below. Derivatives or 
analogs of the nucleic acids or proteins of the invention include, but are not limited to, molecules 
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comprising regions that are substantially homologous to the nucleic acids or proteins of the 
invention, in various embodiments, by at least about 70%, 80%, or 95% identity (with a 
preferred identity of 80-95%) over a nucleic acid or amino acid sequence of identical size or 
when compared to an aligned sequence in which the alignment is done by a computer homology 
5 program known in the art, or whose encoding nucleic acid is capable of hybridizing to the 
complement of a sequence encoding the aforementioned proteins under stringent, moderately 
stringent, or low stringent conditions. See eg. Ausubel, et a!., Current Protocols in 
MOLECULAR BIOLOGY, John Wiley & Sons, New York, NY, 1 993, and below. 

A "homologous nucleic acid sequence" or "homologous amino acid sequence," or 

1 0 variations thereof, refer to sequences characterized by a homology at the nucleotide level or 

amino acid level as discussed above. Homologous nucleotide sequences encode those sequences 
coding for isoforms of MOLX polypeptides. Isoforms can be expressed in different tissues of 
the same organism as a result of, for example, alternative splicing of RNA. Alternatively, 
isoforms can be encoded by different genes. In the invention, homologous nucleotide sequences 

1 5 include nucleotide sequences encoding for an MOLX polypeptide of species other than humans, 
including, but not limited to: vertebrates, and thus can include, e.g., frog, mouse, rat, rabbit, dog, 
cat cow, horse, and other organisms. Homologous nucleotide sequences also include, but are not 
limited to, naturally occurring allelic variations and mutations of the nucleotide sequences set 
forth herein. A homologous nucleotide sequence does not, however, include the exact nucleotide 

20 sequence encoding human MOLX protein. Homologous nucleic acid sequences include those 
nucleic acid sequences that encode conservative amino acid substitutions (see below) in SEQ ID 
NOS: 1, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, and 29, as well as a polypeptide possessing 
MOLX biological activity. Various biological activities of the MOLX proteins are described 
below. ■ 

25 An MOLX polypeptide is encoded by the open reading frame ("ORF") of an MOLX 

nucleic acid. An ORF corresponds to a nucleotide sequence that could potentially be translated 
into a polypeptide. A stretch of nucleic acids comprising an ORF is uninterrupted by a stop 
codon. An ORF that represents the coding sequence for a full protein begins with an ATG 
"start" codon and terminates with one of the three "stop" codons, namely, TAA, TAG, or TGA. 

30 For the purposes of this invention, an ORF may be any part of a coding sequence, with or 

without a start codon, a stop codon, or both. For an ORF to be considered as a good candidate 
for coding for a bona fide cellular protein, a minimum size requirement is often set, e.g., a stretch 
of DNA that would encode a protein of 50 amino acids or more.. 
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The nucleotide sequences determined from the cloning of the human MOLX genes 
allows for the generation of probes and primers designed for use in identifying and/or cloning 
MOLX homologues in other cell types, e.g. from other tissues, as well as MOLX homologues 
from other vertebrates. The probe/primer typically comprises substantially purified 
5 oligonucleotide. The oligonucleotide typically comprises a region of nucleotide sequence thai 
hybridizes under stringent conditions to at least about 12, 25, 50, 100, 150, 200, 250, 300, 350 or 
400 consecutive sense strand nucleotide sequence of SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 
19, 2 1 , 23, 25, 27, and 29; or an anti-sense strand nucleotide sequence of SEQ ID NOS: 1,3,5, 7, 
9, 11, 13, 15, 17, 19,21,23, 25,27, and 29; or of a naturally occurring mutant of SEQ ID 

10 NOS:l,3,5,7,9, 11, 13, 15, 17, 19, 21, 23, 25, 27, and 29. 

Probes based on the human MOLX nucleotide sequences can be used to detect transcripts 
or genomic sequences encoding the same or homologous proteins. In various embodiments, the 
probe further comprises a label group attached thereto, e.g. the label group can be a radioisotope, 
a fluorescent compound, an enzyme, or an enzyme co- factor. Such probes can be used as a part 

15 of a diagnostic test kit for identifying cells or tissues which mis-express an MOLX protein, such 
as by measuring a level of an MOLX-encoding nucleic acid in a sample of cells from a subject 
e.g., detecting MOLX mRNA levels or determining whether a genomic MOLX gene has been 
mutated or deleted. 

. "A polypeptide having a biologically-active portion of an MOLX polypeptide" refers to 
20 polypeptides exhibiting activity similar, but not necessarily identical to, an activity of a 

polypeptide of the invention, including mature forms, as measured in a particular biological 
assay, with or without dose dependency. A nucleic acid fragment encoding a "biologically-? 
active portion of MOLX" can be prepared by isolating a portion SEQ ID NOS:l, 3, 5, 7, 9, 11, 
13, 15, 17, 19, 21, 23, 25, 27, and 29 that encodes a polypeptide having an MOLX biological 
25 activity (the biological activities of the MOLX proteins are described below), expressing the 
encoded portion of MOLX protein {e.g., by recombinant expression in vitro) and assessing the 
activity of the encoded portion of MOLX. 

MOLX Nucleic Acid and Polypeptide Variants 

The invention further encompasses nucleic acid molecules that differ from the nucleotide 
30 sequences shown SEQ ID NOS:i, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, and 29 due to 

degeneracy of the genetic code and thus encode the same MOLX proteins as that encoded by the 
nucleotide sequences shown in SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, and 
29. In another embodiment, an isolated nucleic acid molecule of the invention has a nucleotide 
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sequence encoding a protein having an amino acid sequence shown in SEQ ID NOS:2, 4, 6, 8, 
10, 12, 14, 16,18, 20, 22,24,26,28, and 30. 

In addition to the human MOLX nucleotide sequences shown in SEQ ID NOS:l 9 3, 5, 7, 
9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, and 29 it will be appreciated by those skilled in the art that 
5 DNA sequence polymorphisms that lead to changes in the amino acid sequences of the MOLX 
polypeptides may exist within a population (e.g. , the human population). Such genetic 
polymorphism in the MOLX genes may exist among individuals within a population due to 
natural allelic variation. As used herein, the terms "gene" and "recombinant gene" refer to 
nucleic acid molecules comprising an open reading frame (ORF) encoding an MOLX protein, 

10 preferably a vertebrate MOLX protein. Such natural allelic variations can typically result in 
1-5% variance in the nucleotide sequence of the MOLX genes. Any and all such nucleotide 
variations and resulting amino acid polymorphisms in the MOLX polypeptides, which arc the 
result of natural allelic variation and that do not alter the functional activity of the MOLX 
polypeptides, are intended to be within the scope of the invention. 

1 5 Moreover, nucleic acid molecules encoding MOLX proteins from other species, and thus 

that have a nucleotide sequence that differs from the human sequence SEQ ID NOS:l, 3, 5, 7, 9, 
11, 13, 15, 17, 19. 21, 23, 25, 27, and 29 are intended to be within the scope of the invention. 
Nucleic acid molecules corresponding to natural allelic variants and homologues of the MOLX 
cDN As of the invention can be isolated based on their homology to the human MOLX nucleic 

20 acids disclosed herein using the human cDNAs, or a portion thereof, as a hybridization probe 
according to standard hybridization techniques under stringent hybridization conditions. 

Accordingly, in another embodiment, an isolated nucleic.acid molecule of the invention 
is at least 6 nucleotides in length and hybridizes under stringent conditions to the nucleic acid 
molecule comprising the nucleotide sequence of SEQ ID NOSrl, 3, 5, 7, 9, 11, 13, 15, 17, 19, 

25 21, 23, 25, 27, and 29. In another embodiment, the nucleic acid is at least 10, 25, 50, 100, 250, 
500, 750, 1000, 1500, or 2000 or more nucleotides in length. In yet another embodiment, an 
isolated nucleic acid molecule of the invention hybridizes to the coding region. As used herein, 
the term "hybridizes under stringent conditions" is intended to describe conditions for 
hybridization and washing under which nucleotide sequences at least 60% homologous to each 

30 other typically remain hybridized to each other. 

Homologs (i.e., nucleic acids encoding MOLX proteins derived from species other than 
human) or other related sequences (e.g., paralogs) can be obtained by low, moderate or high 
stringency hybridization with all or a portion of the particular human sequence as a probe using 
methods well known in the art for nucleic acid hybridization and cloning. 

90 



BNSOOCIC* <WO 0181578A2J_> 



WO01/XI578 



PCT/US01/1357X 



As used herein, the phrase "stringent hybridization conditions" refers to conditions under 
which a probe, primer or oligonucleotide will hybridize to its target sequence, but to no other 
sequences. Stringent conditions are sequence-dependent and will be different in different 
circumstances. Longer sequences hybridize specifically at higher temperatures than shorter 
5 sequences. Generally, stringent conditions are selected to be about 5 °C lower than the thermal 
melting point (Tm) for the specific sequence at a defined ionic strength and pH. The Tm is the 
temperature (under defined ionic strength, pH and nucleic acid concentration) at which 50% of 
the probes complementary to the target sequence hybridize to the target sequence at equilibrium. 
Since the target sequences are generally present at excess, at Tm, 50% of the probes are occupied 

10 at equilibrium. Typically, stringent conditions will be those in which the salt concentration is 
less than about 1 .0 M sodium ion, typically about 0.01 to 1 .0 M sodium ion (or other salts) at 
pH 7.0 to 83 and the temperature is at least about 30°C for short probes, primers or 
oligonucleotides (e.g., 10 nt to 50 nt) and at least about 60°C for longer probes, primers and 
oligonucleotides. Stringent conditions may also be achieved with the addition of destabilizing 

1 5 agents, such as formamide. 

Stringent conditions are known to those skilled in the art and can be found in Ausubel, et 
aL, (eds.), Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 
6.3.1-6,3.6. Preferably, the conditions are such that sequences at least about 65%, 70%, 75%, 
85%, 90%, 95%, 98%, or 99% homologous to each other typically remain hybridized to each 

20 other. A non-limiting example of stringent hybridization conditions are hybridization in a high 
salt buffer comprising 6X SSC, 50 mM Tris-HCJ (pH 7.5), 1 mM EDTA, 0.02% PVP, 0:02% 
Ficoll, 0.02% BSA, and 500 mg/ml denatured salmon sperm DNA at 65°C, followed by one or 
more washes in 0.2X SSC, 0.01 % BSA at 50°C. An isolated nucleic apid molecule of the 
invention that hybridizes under stringent conditions to the sequences of SEQ ID NOS: 1,3,5,7, 

25 9, 11, 13, 15, 17, 19, 21,23,25, 27, and 29 corresponds to a hatumlly^ccumng nucleic acid 
molecule. As used herein, a "naturallyoccurring ,! nucleic acid molecule refers to an RNA or 
DNA molecule having a nucleotide sequence that occurs in nature (eg., encodes a natural 
protein). 

In a second embodiment, a nucleic acid sequence that is hybridizable to the nucleic acid 
30 molecule comprising the nucleotide sequence of SEQ ID NOS:l, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 
21, 23, 25, 27, and 29 or fragments, analogs or derivatives thereof, under conditions of moderate 
stringency is provided. A non-limiting example of moderate stringency hybridization conditions 
are hybridization in 6X SSC, 5X Denhardfs solution, 0.5% SDS and 100 mg/ml denatured 
salmon sperm DNA at 55°C, followed by one or more washes in IX SSC, 0.1% SDS at 37°C. 
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expressed by any recombinant technology known in the art and the activity of the protein can be 
determined. 

The relatedness of amino acid families may also be determined based on side chain 
interactions. Substituted amino acids may be fully conserved "strong" residues or fully 
5 conserved "weak" residues. The "strong" group of conserved amino acid residues may be any 
one of the following groups: STA, NEQK, NHQK, NDEQ, QHRK, MILV, M1LF, HY, FYW, 
wherein the single letter amino acid codes are grouped by those amino acids that may be 
substituted for each other. Likewise, the "weak" group of conserved residues may be any one of 
the following: CSA, ATV, SAG, SINK, STTA, SGND, SNDEQK, NDEQHK, NEQHRK, 
10 VLIM, HFY, wherein the letters within each group represent the single letter amino acid code. 

In one embodiment, a mutant MOLX protein can be assayed for (i) the ability to form 
protein:protein interactions with other MOLX proteins, other cell-surface proteins, or 
biologically-active portions thereof, (ii) complex formation between a mutant MOLX protein 
and an MOLX ligand; or (Hi) the ability of a mutant MOLX protein to bind to an intracellular 
1 5 target protein or biologically-active portion thereof; (e.g. avidin proteins). 

In yet another embodiment, a mutant MOLX protein can be assayed for the ability to 
regulate a specific biological function (e.g., regulation of insulin release). 

Antisense Nucleic Acids 

Another aspect of the invention pertains to isolated antisense nucleic acid molecules that 
20 are hybridizable to or complementary to the nucleic acid molecule comprising the nucleotide 
sequence of SEQ ID NQS:1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, and 29, or fragments, 
analogs or derivatives thereof. An "antisense" nucleic acid comprises a nucleotide sequence that 
is complementary to a "sense" nucleic acid encoding a protein complementary to the 
coding strand of a double-stranded cDNA molecule or complementary to an mRNA sequence), 
25 In specific aspects, antisense nucleic acid molecules are provided that comprise a sequence 
complementary to at least about 10, 25, 50, 100, 250 or 500 nucleotides or an entire MOLX 
coding strand, or to only a portion thereof. Nucleic acid molecules encoding fragments, 
homologs, derivatives and analogs of an MOLX protein of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 
16, 18, 20, 22, 24, 26, 28, and 30, or antisense nucleic acids complementary to an MOLX nucleic 
30 acid sequence of SEQ ID NOS: 1, 3, 5, 7, 9, 1 1, 13, 1 5, 17, 19, 21, 23, 25, 27, and 29, are 
additionally provided. 

In one embodiment, an antisense nucleic acid molecule is antisense to a "coding region" 
of the coding strand of a nucleotide sequence encoding an MOLX protein. The term "coding 
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region" refers to the region of the nucleotide sequence comprising codons which are translated 
into amino acid residues. In another embodiment, the antisensc nucleic acid molecule is 
antisense to a "noncoding region" of the coding strand of a nucleotide sequence encoding the 
MOLX protein. The term "noncoding region" refers to 5' and 3' sequences which flank the 
5 coding region that are not translated into amino acids (i.e., also referred to as 5* and 3' 
untranslated regions). 

Given the coding strand sequences encoding the MOLX protein disclosed herein, 
antisense nucleic acids of the invention can be designed according to the rules of Watson and 
Crick or Hoogsteen base pairing. The antisense nucleic acid molecule can be complementary to 

1 0 the entire coding region of MOLX mRNA, but more preferably is an oligonucleotide that is 

antisense to only a portion of the coding or noncoding region of MOLX mRNA. For example, 
the antisense oligonucleotide can be complementary to the region surrounding the translation 
start site of MOLX mRNA. An antisense oligonucleotide can be, for example, about 5, 10, 1 5, 
20, 25, 30, 35, 40, 45 or 50 nucleotides in length. An antisense nucleic acid of the invention can 

15 be constructed using chemical synthesis or enzymatic ligation reactions using procedures known 
in the art. For example, an antisense nucleic acid (eg., an antisense oligonucleotide) can be 
chemically synthesized using naturally-occurring nucleotides or variously modified nucleotides 
designed to increase the biological stability of the molecules or to increase the physical stability 
of the duplex formed between the antisense and sense nucleic acids (e.g. , phosphorothioate 

20 derivatives and acridine substituted nucleotides can be used). 

Examples of modified nucleotides that can be used to generate the antisense nucleic acid 
include: 5-fluorburacil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, . 
4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5-carboxymethylarninomethyl* 
2-thiouridine, 5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, 

25 inosine, N6-isopentenyladenine, 1 -methylguanine, 1-methylinosine, 2,2-dimethylguanine, 
2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 
7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, 
beta-D-mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 

2- methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, 
30 queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, 

uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 

3- (3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. Alternatively, the 
antisense nucleic acid can be produced biologically using an expression vector into which a 
nucleic acid has been subcloned in an antisense orientation (Le. f RNA transcribed from the 
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inserted nucleic acid will be of an antisense orientation to a target nucleic acid of interest, 
described further in the following subsection). 

The antisense nucleic acid molecules of the invention are typically administered to a 
subject or generated in situ such that they hybridize with or bind to cellular mKNA and/or 
5 genomic DN A encoding an MOLX protein to thereby inhibit expression of the protein (e.g. , by 
inhibiting transcription and/or translation). The hybridization can be by conventional nucleotide 
complementarity to form a stable duplex, or, for example, in the case of an antisense nucleic acid 
molecule that binds to DNA duplexes, through specific interactions in the major groove of the 
double helix. An example of a route of administration of antisense nucleic acid molecules of the 

10 invention includes direct injection at a tissue site. Alternatively, antisense nucleic acid 

molecules can be modified to target selected cells and then administered systemically. For 
example, for systemic administration, antisense molecules can be modified such that they 
specifically bind to receptors or antigens expressed on a selected cell surface (e.g., by linking the 
antisense nucleic acid molecules to peptides or antibodies that bind to cell surface receptors or 

1 5 antigens). The antisense nucleic acid molecules can also be delivered to cells using the vectors 
described herein. To achieve sufficient nucleic acid molecules, vector constructs in which the 
antisense nucleic acid molecule is placed under the control of a strong pol II or pol IE promoter 
are. preferred. 

In yet another embodiment, the antisense nucleic acid molecule of the invention is an 
20 a-anomeric nucleic acid molecule. An ct-anomeric nucleic acid molecule forms specific 

double-stranded hybrids with complementary RNA in which; contrary to the usual P-units, the 
strands run parallel to each other. See, e.g., Gaultier, et a!., 1987. Nuci- Acids Res. 15: 
6625-664 1 . The antisense nucleic acid molecule can also comprise a 2 , -o-methylribonucleotide 
(see, eg., Inoue, et al, 1987. Nucl. Acids Res. 15: 6131-6148) or a chimeric RNA-DNA analogue 
25 (see, e.g., Inoue, et al., 1987. FEBSLett 215: 327-330. 

Ribozymes and PNA Moieties 

Nucleic acid modifications include, by way of non-limiting example, modified bases, and 
nucleic acids whose sugar phosphate backbones are modified or derivatized. These 
30 modifications are carried out at least in part to enhance the chemical stability of the modified 
nucleic acid, such that they may be used, for example, as antisense binding nucleic acids in 
therapeutic applications in a subject. 

In one embodiment, an antisense nucleic acid of the invention is a ribozyme. Ribozymes 
are catalytic RNA molecules with ribonuclease activity that are capable of cleaving a 
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single-stranded nucleic acid, such as an mRNA, to which they have a complementary region. 
Thus, ribozymes (eg., hammerhead ribozymes as described in Hasclhoff and Gerlach 1988. 
Nature 334: 585-591) can be used to catalytically cleave MOLX rnRNA transcripts to thereby 
inhibit translation of MOLX mRNA. A ribozyme having specificity for an MOLX-encoding 
5 nucleic acid can be designed based upon the nucleotide sequence of an MOLX cDNA disclosed 
herein (i.e., SEQ ID NOS:l, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, and 29). For example, a 
derivative of a Tetrahymena L-19 IYS RNA can be constructed in which the nucleotide sequence 
of the active site is complementary to the nucleotide sequence to be cleaved in an 
MOLX-encoding mRNA. See, eg., U.S. Patent 4,987,071 to Cech, et al and U.S. Patent 
10 5,1 1 6,742 to Cech, et al MOLX mRNA can also be used to select a catalytic RNA having a 
specific ribonuclease activity from a pool of RNA molecules. See, eg., Bartel etal, (1993) 
Science 261:1411-1418. 

Alternatively, MOLX gene expression can be inhibited by targeting nucleotide sequences 
complementary to the regulatory region of the MOLX nucleic acid (eg., the MOLX promoter 
1 5 and/or enhancers) to form triple helical structures that prevent transcription of the MOLX gene in. 
target cells. See, e.g., Helene, 1991. Anticancer Drug Des. 6: 569-84; Helene, et al 1992. Ann. 
N.Y. Acad Sci. 660: 27-36; Maher, 1992. Bioassays 14: 807-15. 

In various embodiments, the MOLX nucleic acids can be modified at the base moiety, 
sugar moiety or phosphate backbone to improve, eg., the stability, hybridization, or solubility of 
20 the molecule. For example, the deoxyribose phosphate backbone of the nucleic acids can be 
modified to generate peptide nucleic acids. See f e.g., Hyrup, etal, 1996. BioorgMed Chem 4: 
5-23. As used herein, the terms "peptide nucleic acids' 1 or "PNAs" refer to nucleic acid mimics 
(eg. , DNA mimics) in which the deoxyribose phosphate backbone is replaced by a 
pseudopeptide backbone and only the four natural nucleobases are retained. The neutral 
25 backbone of PNAs has been shown to allow for specific hybridization to DNA and RNA under 
conditions of low ionic strength. The synthesis of PNA oligomers can be performed using 
standard solid phase peptide synthesis protocols as described in Hyrup, et al, 1996. supra\ 
Perry-O'Keefe, et al, 1996. Proc. Natl Acad. ScL USA 93: 14670-14675. 

PNAs of MOLX can be used in therapeutic and diagnostic applications. For example, 
30 PNAs can be used as antisense or antigene agents for sequence-specific modulation of gene 

expression by, e.g., inducing transcription or translation arrest or inhibiting replication. PNAs of 
MOLX can also be used, for example, in the analysis of single base pair mutations in a gene 
(e.g. , PNA directed PGR clamping; as artificial restriction enzymes when used in combination 
with other enzymes, eg., S\ nucleases (see, Hyrup, et al. 1996-si/pra); or as probes or primers 

.97 



BNSDCCIO: <WO_0181578A2J_> 



WO 01/81578 



PCT/USO 1/1 3578 



for DMA sequence and hybridization (see, Hyrup, et a/., 1996, supra; Perry-O'Keefe, et ai, 
1996. supra). 

In another embodiment, PNAs of MOLX can be modified, e.g., to enhance their stability 
or cellular uptake, by attaching lipophilic or other helper groups to PNA, by the formation of 
5 PNA-DNA chimeras, or by the use of liposomes or other techniques of drug delivery known in 
the art. For example, PNA-DNA chimeras of MOLX can be generated that may combine the 
advantageous properties of PNA and DNA. Such chimeras allow DNA recognition enzymes 
(e.g., RNase H and DNA polymerases) to interact with the DNA portion while the PNA portion 
would provide high binding affinity and specificity. PNA-DNA chimeras can be linked using 

10 linkers of appropriate lengths selected in terms of base stacking, number of bonds between the 
niicleobases, and orientation (see, Hyrup, et al., 1996. supra). The synthesis of PNA-DNA 
chimeras can be performed as described in Hyrup, et ai, 1996. supra and Finn, et al, 1996. Nucl 
Acids Res 24: 3357-3363. For example, a DNA chain can be synthesized on a solid support 
using standard phosphoramidite coupling chemistry, and modified nucleoside analogs, e.g. 9 

15 5X4-methoxytrityl)arnino-5 f -deoxy-thymidine phosphoramidite, can be used between the PNA 
and the 5' end of DNA. See, e.g., Mag, et aL, 1989. Nucl Acid Res >11: 5973-5988. PNA 
monomers are then coupled in a stepwise manner to produce a chimeric molecule with a 5* PNA 
segment and a 3' DNA segment. See, e.g., Finn, et al. y 1996. supra. Alternatively, chimeric 
molecules can be synthesized with a 5' DNA segment and a 3 ! PNA segment. See, e.g., Petersen, 

20 et aL, 1 975. Bioorg. Med. Client. Lett. 5: 1119-11 124. 

In other embodiments, the oligonucleotide may include other appended groups such as 
peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating transport across the . 
cell membrane (see, e.g: f Letsinger, et al, ,1989. Proc. Natl Acad. Set U.SA. 86: 6553-6556; 
Lernaitre, et al y 1987. Proc. Natl Acad. Sci. 84: 648-652; PCT Publication No. WO88/09810) or 

25 the blood-brain barrier (.see, e.g. , PCT Publication No. WO 89/10 1 34). In addition, 

oligonucleotides can be modified with hybridization triggered cleavage agents (see, e.g. t Krol, 'er 
a/., 1988. BioTechniques 6:958-976) or intercalating agents (see, e.g. t Zon, 1988. Pharm. Res. 5: 
539-549). To this end, the oligonucleotide may be conjugated to another molecule, e.g., a 
peptide, a hybridization triggered cross-linking agent } a transport agent, a hybridization-triggered 

30 cleavage agent, and the like. 

MOLX Polypeptides 

A polypeptide according to the invention includes a polypeptide including the amino acid 
sequence of MOLX polypeptides whose sequences are provided in SEQ ID NOS:2, 4, 6, 8, 10, . 
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12, 14, 16, 18, 20 ; 22, 24, 26, 28, and 30. The invention also includes a mutant or variant protein 
any of whose residues may be changed from the corresponding residues shown in SEQ ID 
NOS:2 f 4, 6, 8, 10, 12, 14, 1 6, 1 8, 20, 22, 24, 26, 28, and 30 while still encoding a protein that 
maintains its MOLX activities and physiological functions, or a functional fragment thereof. 
5 In general, an MOLX variant that preserves MOLX-like function includes any variant in 

which residues at a particular position in the sequence have been substituted by other amino 
acids, and further include the possibility of inserting an additional residue or residues between 
two residues of the parent protein as well as the possibility of deleting one or more residues from 
the parent sequence. Any amino acid substitution, insertion, or deletion is encompassed by the 
10 invention. In favorable circumstances, the substitution is a conservative substitution as defined 
above. 

One aspect of the invention pertains to isolated MOLX proteins, and biologically-active 
portions thereof, or derivatives, fragments, analogs or homologs thereof. Also provided are 
polypeptide fragments suitable for use as immunogens to raise anti-MOLX antibodies. In one 

1 5 embodiment, native MOLX proteins can be isolated from cells or tissue sources by an 

appropriate purification scheme using standard protein purification techniques. In another 
embodiment, MOLX proteins are produced by recombinant DNA techniques. Alternative to 
recombinant expression, an MOLX protein or polypeptide can be synthesized chemically using 
standard peptide synthesis techniques. 

20 An "isolated" or "purified" polypeptide or protein or biologically-active portion thereof is 

substantially free of cellular material or other contaminating proteins from the cell or tissue 
source from which the MOLX protein is derived, or substantially free from chemical precursors 
or other chemicals when chemically synthesized. The language "substantially free of cellular 
material" includes preparations of MOLX proteins in which the protein is separated from cellular 

25 components dfthe cells from which it is isolated or recombinantly-produced. In one 

embodiment, the language "substantially free of cellular material" includes preparations of 
MOLX proteins having less than about 30% (by dry weight) of non-MOLX proteins (also 
referred to herein as a "contaminating protein"), more preferably less than about 20% of 
non-MOLX proteins, still more preferably less than about 10% of non-MOLX proteins, and most 

30 preferably less than about 5% of non-MOLX proteins. When the MOLX protein or biologically- 
active portion thereof is recombinantly-produced, it is also preferably substantially free of 
culture medium, Le. t culture medium represents less than about 20%, more preferably less than 
about 1 0%, and most preferably less than about 5% of the volume of the MOLX protein 
preparation. 
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The language "substantially free of chemical precursors or other chemicals" includes 
preparations of MOLX proteins in which the protein is separated from chemical precursors or 
other chemicals that are involved in the synthesis of the protein. In one embodiment, the 
language "substantially free of chemical precursors or other chemicals" includes preparations of 
5 MOLX proteins having less than about 30% (by dry weight) of chemical precursors or 

non-MOLX chemicals, more preferably less than about 20% chemical precursors or non-MOLX 
chemicals, still more preferably less than about 10% chemical precursors or non-MOLX 
chemicals, and most preferably less than about 5% chemical precursors or non-MOLX 
chemicals. 

1 0 Biologically-active portions of MOLX proteins include peptides comprising amino acid 

sequences sufficiently homologous to or derived from the amino acid sequences of the MOLX 
proteins (e.g., the amino acid sequence shown in SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 
22, 24, 26, 28, and 30) that include fewer amino acids than the full-length MOLX proteins, and 
exhibit at least one activity of an MOLX protein. Typically, biologically-active portions 

1 5 comprise a domain or motif with at least one activity of the MOLX protein. A biologically- 
active portion of an MOLX protein can be a polypeptide which is, for example, 10, 25, 50, 100 
or more amino acid residues in length. 

Moreover, other biologically-active portions, in which other regions of the protein are 
deleted, can be prepared by recombinant techniques and evaluated for one or more of the 

20 functional activities of a native MOLX protein. 

In an embodiment, the MOLX protein has an amino acid sequence shown in SEQ ID 
NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 2& 28, and 30. in other embodiments, the MOLX 
protein is substantially homologous to SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, .18, 20, 22, 24, 26, 
28, and 30, and retains the functional activity of the protein of SEQ ID NOS:2, 4, 6, 8, 10, 12, 

25 14, 16, 18, 20, 22, 24, 26, 28, and 30, yet differs in amino acid sequence due to natural allelic 

variation or mutagenesis, as described in detail, below. Accordingly, in another embodiment, the 
MOLX protein is a protein that comprises an amino acid sequence at least about 45% 
homologous to the amino acid sequence SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 
26, 28, and 30, and retains the functional activity of the MOLX proteins of SEQ ID NOS:2, 4, 6, 

30 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30. 



Determining Homology Between Two or More Sequences 

To determine the. percent homology of two amino acid sequences or of two nucleic acids, 
_ the sequences are aligned for optimal comparison purposes (eg., gaps can be introduced in the 
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sequence of a first amino acid or nucleic acid sequence for optimal alignment with a second 
amino or nucleic acid sequence). The amino acid residues or nucleotides at corresponding amino 
acid positions or nucleotide positions are then compared. When a position in the first sequence 
is occupied by the same amino acid residue or nucleotide as the corresponding position in the 
5 second sequence, then the molecules are homologous at that position (i.e. , as used herein amino 
acid or nucleic acid "homology" is equivalent to amino acid or nucleic acid "identity"). 

The nucleic acid sequence homology may be determined as the degree of identity 
between two sequences. The homology may be determined using computer programs known in 
the art, such as GAP software provided in the GCG program package. See, Needleman and 

10 Wunsch, 1970. J Mo! Biol 48: 443-453. Using GCG GAP software with the following settings 
for nucleic acid sequence comparison: GAP creation penalty of 5.0 and GAP extension penalty 
of 0.3, the coding region of the analogous nucleic acid sequences referred to above exhibits a 
degree of identity preferably of at least 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 99%, with the 
CDS (encoding) part of the DNA sequence shown in SEQ ID NOS:l, 3, 5, 7, 9, 1 1, 13, 15, 17, 

15 19, 21, 23, 25, 27, and 29. 

The term "sequence identity" refers to the degree to which two polynucleotide or 
polypeptide sequences are identical on a residue-by-residue basis over a particular region of 
comparison. The term "percentage of sequence identity" is calculated by comparing two 
optimally aligned sequences over that region of comparison, determining the number of positions 

20 at which the identical nucleic acid base (e.g., A, T, C, G, U, or I, in the case of nucleic acids) 
occurs in both sequences to yield the number of matched positions, dividing the number of 
matched positions by the total number of positions in the region of comparison (i. e. , the window 
size), and multiplying the result by 100 to yield the percentage of sequence identity. The term 
"substantial identity" as used herein denotes a characteristic of a polynucleotide sequence, 

25 wherein the polynucleotide comprises a sequence that has at least 80 percent sequence identity, 
preferably at least 85 percent identity and often 90 to 95 percent sequence identity, more usually 
at least 99 percent sequence identity as compared to a reference sequence over a comparison 
region. 

30 Chimeric and Fusion Proteins 

The invention also provides MOLX chimeric or fusion proteins. As used herein, an 
MOLX "chimeric protein" or "fusion protein" comprises an MOLX polypeptide operatively- 
linked to a non-MOLX polypeptide. An "MOLX polypeptide" refers to a polypeptide having an 
. amino acid sequence corresponding to an MOLX protein (SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 
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18, 20, 22, 24, 26, 28, and 30), whereas a "non-MOLX polypeptide" refers to a polypeptide 
having an amino acid sequence corresponding to a protein that is not substantially homologous 
to the MOLX protein, e.g., a protein that is different from the MOLX protein and that is derived 
from the same or a different organism. Within an MOLX fusion protein the MOLX polypeptide 
5 can correspond to all or a portion of an MOLX protein. In one embodiment, an MOLX fusion 
protein comprises at least one biologically-active portion of an MOLX protein. In another 
embodiment, an MOLX fusion protein comprises at least two biologically-active portions of an 
MOLX protein. In yet another embodiment, an MOLX fusion protein comprises at least three 
biologically-active portions of an MOLX protein. Within the fusion protein, the term 

1 0 "operati vely-linked" is intended to indicate that the MOLX polypeptide and the non-MOLX 
polypeptide are fused in-frame with one another. The non-MOLX polypeptide can be fused to 
the N-terminus or C-terminus of the MOLX polypeptide. 

In one embodiment, the fusion protein is a GST-MOLX fusion protein in which the 
MOLX sequences are fused to the C-terminus of the GST (glutathione S-transferase) sequences. 

1 5 Such fusion proteins can facilitate the purification of recombinant MOLX polypeptides. 

In another embodiment, the fusion protein is an MOLX protein containing a heterologous 
signal sequence at its N-terminus. In certain host cells (e.g., mammalian host cells), expression 
and/or secretion of MOLX can be increased through use of a heterologous signal sequence. 
In yet.another embodiment, the fusion protein is an MOLX-immunoglobulin fusion 

20 protein in which the MOLX sequences are fused to sequences derived from a member of the 
immunoglobulin protein family. The MOLX-immunoglobulin fusion proteins of the invention 
can be incorporated into pharmaceutical compositions and administered to a subject to inhibit an 
interaction between an MOLX ligand and an MOLX protein on the surface of a cell, to thereby 
suppress MOLX-mediated signal transduction in vivo. The MOLX-immunoglobulin fusion 

25 proteins can be used to affect the bioavailability of an MOLX cogjiate ligand. Inhibition of the 
MOLX ligand/MOLX interaction may be useful therapeutically for both the treatment of 
proliferative and differentiative disorders, as well as modulating (e.g. promoting or inhibiting) 
cell survival. Moreover, the MOLX-immunoglobulin fusion proteins of the invention can be 
used as immunogens to produce anti-MOLX antibodies in a subject, to purify MOLX ligands, 

30 and in screening assays to identify molecules that inhibit the interaction of MOLX with an 
MOLX ligand. 

An MOLX chimeric or fusion protein of the invention can be produced by standard 
recombinant DNA techniques. For example, DNA fragments coding for the different 
polypeptide sequences are ligated together in-frame in accordance with conventional techniques, 
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e.g., by employing blunt-ended ot stagger-ended termini for ligation, restriction enzyme 
digestion to provide for appropriate termini, filling-in of cohesive ends as appropriate, alkaline 
phosphatase treatment to avoid undesirable joining, and enzymatic ligation. In another 
embodiment, the fusion gene can be synthesized by conventional tecliniques including 
5 automated DNA synthesizers. Alternatively, PCR amplification of gene fragments can be 
carried out using anchor primers that give rise to complementary overhangs between two 
consecutive gene fragments that can subsequently be annealed and reamplified to generate a 
chimeric gene sequence (see, e.g., Ausubel, al (eds.) CURRENT PROTOCOLS IN MOLECULAR 
BIOLOGY, John Wiley & Sons, 1992). Moreover, many expression vectors are commercially 
10 available that already encode a fusion moiety (e.g., a GST polypeptide). An MOLX-encoding 
nucleic acid can be cloned into such an expression vector such that the fusion moiety is linked 
in-frame to the MOLX protein. 

MOLX Agonists and Antagonists 

The invention also pertains to variants of the MOLX proteins that function as either 
MOLX agonists (i.e., mimetics) or as MOLX antagonists. Variants of the MOLX protein can be 
generated by mutagenesis (e.g., discrete point mutation or truncation of the MOLX protein). An 
agonist.of the MOLX protein can retain substantially the same, or a subset of, the biological 
activities of the naturally occurring form of the MOLX protein. An antagonist of the MOLX 
protein cart inhibit one or more of the activities of the naturally occurring form of the MOLX 
protein by, for example, competitively binding to a downstream or upstream member of a 
cellular signaling cascade which includes the MOLX protein. Thus, specific biological effects 
can be elicited by treatment with a variant of limited function. In one embodiment, treatment of 
a subject with a variant having a subset of the biological activities of the naturally occurring 
form of the protein has fewer side effects in a subject relative to treatment with the naturally 
occurring form of the MOLX proteins. 

Variants of the MOLX proteins that function as either MOLX agonists (i.e. f mimetics) or 
as MOLX antagonists can be identified by screening combinatorial libraries of mutants (e.g. f 
truncation mutants) of the MOLX proteins for MOLX protein agonist or antagonist activity. In 
one embodiment, a variegated library of MOLX variants is generated by combinatorial 
mutagenesis at the nucleic acid level and is encoded by a variegated gene library. A variegated 
library of MOLX variants can be produced by, for example, enzymatically ligating a mixture of 
synthetic oligonucleotides into gene sequences such that a degenerate set of potential MOLX 
sequences is expressible as individual polypeptides, or alternatively, as a set of larger fusion 
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proteins (e.g., for phage display) containing the set of MOLX sequences therein. There are a 
variety of methods which can be used to produce libraries of potential MOLX variants from a 
degenerate oligonucleotide sequence. Chemical synthesis of a degenerate gene sequence can be 
performed in an automatic DNA synthesizer, and the synthetic gene then ligated into an 
5 appropriate expression vector. Use of a degenerate set of genes allows for the provision, in one 
mixture, of all of the sequences encoding the desired set of potential MOLX sequences. 
Methods for synthesizing degenerate oligonucleotides are well-known within the art. See, e.g., 
Narang, 1983. Tetrahedron 39: 3; Itakura, et al, 1984. Annu. Rev. Biochem. 53: 323; Itakura, et 
a/., 1984. Science 198: 1056; Ike, ei al, 1983. Nucl AcidsRes. 11:477. 

10 

Polypeptide Libraries 

In addition, libraries of fragments of the MOLX protein coding sequences can be used to 
generate a variegated population of MOLX fragments for screening and subsequent selection of 
variants of an MOLX protein. In one embodiment, a library of coding sequence fragments can 

15 be generated by treating a double stranded PCR fragment of an MOLX coding sequence with a 
nuclease under conditions wherein nicking occurs only about once per molecule, denaturing the 
double stranded DNA, renaturing the DNA to form double-stranded DNA that can include 
sense/antisense pairs from different nicked products, removing single stranded portions from 
reformed duplexes by treatment with Si nuclease, and ligating the resulting fragment library into 

20 an expression vector. By this method,, expression libraries can be derived which encodes 
N-terminal and internal fragments of various sizes of the MOLX proteins. 

Various techniques are known in the art for screening gene products of combinatorial 
libraries made by point mutations or truncation, and for screening cDNA libraries for gene 
products having a selected property. Such techniques are adaptable for rapid screening of the 

25 gene libraries generated by the combinatorial mutagenesis of MOLX proteins. The most widely 
used techniques, which are amenable to high throughput analysis, for screening large gene 
libraries typically include cloning the gene library into replicable expression vectors, 
transforming appropriate cells with the resulting library of vectors, and expressing the 
combinatorial genes under conditions in which detection of a desired activity facilitates isolation 

30 of the vector encoding the gene whose product was detected. Recursive ensemble mutagenesis 
(REM), a new technique that enhances the frequency of functional mutants in the libraries, can 
be used in combination with the screening assays to identify MOLX variants. See, e.g„ Arkin 
and Yourvan, 1992. Proc. Natl Acad. ScL USA 89: 781 1-7815; Delgrave, et aL, 1993. Protein 
Engineering 6:327 '-331. 

104 



BNSDOCID: <WO 01B1S78A2J_> 



WO 01/81578 



PCT/USO 1/13578 



Anti-MOLX Antibodies 

The invention encompasses antibodies and antibody fragments, such as F ab or (F ab )2, that 
bind immunospecifically to any of the MOLX polypeptides of said invention. 

An isolated MOLX protein, or a portion or fragment thereof, can be used as an 
5 immunogen to generate antibodies that bind to MOLX polypeptides using standard techniques 
for polyclonal and monoclonal antibody preparation. The full-length MOLX proteins can be 
used or, alternatively, the invention provides antigenic peptide fragments of MOLX proteins for 
use as immunogens. The antigenic MOLX peptides comprises at least 4 amino acid residues of 
the amino acid sequence shown in SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 1 8, 20, 22, 24, 26, 28, 

1 0 and 30 and encompasses an epitope of MOLX such that an antibody raised against the peptide 
forms a specific immune complex with MOLX. Preferably, the antigenic peptide comprises at 
least 6, 8, 10, 1 5, 20, or 30 amino acid residues. Longer antigenic peptides are sometimes 
preferable over shorter antigenic peptides, depending on use and according to methods well 
known to someone skilled in the art. 

15 In certain embodiments of the invention, at least one epitope encompassed by the 

antigenic peptide is a region of MOLX that is located on the surface of the protein (e.g., a 
hydrophilic region). As a means for targeting antibody production, hydropathy plots showing 
regions of hydrophilicity and hydrophobicity may be generated by any method well known in the 
art, including, for example, the Kyte Doolittle or the Hopp Woods methods, either with or 

20 without Fourier transformation (see, e.g., Hopp and Woods, 1981. Proc. Nat Acad. Sci. USA 78: 
3824-3828; Kyte and Doolittle, 1982. J. MoL Biol. 157: 105-142, each incorporated herein by 
reference in their entirety). 

As disclosed herein, MOLX protein sequences of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 
1 8, 20, 22, 24, 26, 28, and 30, or derivatives, fragments, analogs or homologs thereof, may be 

25 utilized as immunogens in the generation of antibodies that immunospecifically-bind these 

protein components. The term "antibody" as used herein refers to immunoglobulin molecules 
and immunologically-active portions of immunoglobulin molecules, i.e., molecules that contain 
an antigen binding site that specifically-binds (immunoreacts with) an antigen, such as MOLX. 
Such antibodies include, but are not limited to, polyclonal, monoclonal, chimeric, single chain, 

30 F ab and F mi fragments, and an expression library. In a specific embodiment, antibodies to 
human MOLX proteins are disclosed. Various procedures known within the art may be used for 
the production of polyclonal or monoclonal antibodies to an MOLX protein sequence of SEQ ID 
NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30, or a derivative, fragment, analog or 
homolog thereof. Some of these proteins are discussed below. 
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For the production of polyclonal antibodies, various suitable host animals (e.g., rabbit, 
goat, mouse or other mammal) may be immunized by injection with the native protein, or a 
synthetic variant thereof, or a derivative of the foregoing. An appropriate immunogenic 
preparation can contain, for example, recombinantly-expressed MOLX protein or a chemically- 
5 synthesized MOLX polypeptide. The preparation can further include an adjuvant. Various 
adjuvants used to increase the immunological response include, but are not limited to, Freund's 
(complete and incomplete), mineral gels (e.g., aluminum hydroxide), surface active substances 
(e.g., lysolecithin, pluronic polyols, poly anions, peptides, oil emulsions, dinitrophenol, etc.), 
human adjuvants such as Bacille Cabnette-Guerin and Corynebacterium parvum, or similar 

10 inimunostimulatory agents. If desired, the antibody molecules directed against MOLX can be 
isolated from the mammal (e.g., from the blood) and further purified by well known techniques, 
such as protein A chromatography to obtain the IgG fraction. 

The term "monoclonal antibody" or "monoclonal antibody composition", as used herein, 
refers to a population of antibody molecules that contain only one species of an antigen binding 

1 5 site capable of immunoreacting with a particular epitope of MOLX. A monoclonal antibody 

composition thus typically displays a single binding affinity for a particular MOLX protein with 
which it immunoreacts. For preparation of monoclonal antibodies directed towards a particular 
MOLX protein, or derivatives, fragments, analogs or homologs thereof, any technique that 
provides for the production of antibody .molecules by continuous cell line culture may be 

20 utilized. Such techniques include, but arc not limited to, the hybridoma technique (see, e.g., 
Kohler & Milstein, 1975. Nature 256: 495-497); the trioma technique; the human B-cell 
hybridoma technique (see, e.g., Kozbor, et al, 1983. Immunol Today 4: 72) and the EBV 
hybridoma technique to produce human monoclonal antibodies (see, e.g: t Col6, et al, 1985. In: 
Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). Human 

25 monoclonal antibodies may be utilized in the practice of the invention and may be produced by 
using human hybridomas (see, e.g., Cote, et al, 1983. Proc Natl Acad Sci USA SO: 2026-2030) 
or by transforming human B-cells with Epstein Barr Virus in vitro (see, e.g., Cole, et al, 1985. 
In: MONOCLONAL ANTIBODIES AND CANCER THERAPY, Alan EL Liss, Inc., pp. 77-96). Each of 
the above citations is incorporated herein by reference in their entirety. 

30 According to the invention, techniques can be adapted for the production of single-chain 

antibodies specific to an MOLX protein (see, e.g., U.S. Patent No. 4,946,778). In addition, 
methods can be adapted for the construction of F a b expression hbraries (see, e.g., Huse, et al, 
1989. Science 246: 1275-1281) to allow rapid and effective identification of monoclonal F»b 
fragments with the desired specificity for an MOLX protein or derivatives, fragments, analogs or 
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homoiogs thereof- Non-human antibodies can be "humanized" by techniques well known in the 
art. See, e.g., U.S. Patent No. 5,225,539. Antibody fragments that contain the idiotypes to an 
MOLX protein may be produced by techniques known in the art including, but not limited to: (/) 
an F( a y)2 fragment produced by pepsin digestion of an antibody molecule; (ii) an F a b fragment 
5 generated by reducing the disulfide bridges of an F( ab ')2 fragment; (Hi) an F a b fragment generated 
by the treatment of the antibody molecule with papain and a reducing agent; and (iv) F v 
fragments. 

Additionally, recombinant anti-MOLX antibodies, such as chimeric and humanized 
monoclonal antibodies, comprising both human and non-human portions, which can be made 
10 using standard recombinant DNA techniques, are within the scope of the invention. Such 
chimeric and humanized monoclonal antibodies can be produced by recombinant DNA 
techniques known in the art, for example using methods described in International Application 
No. PCT/US86/02269; European Patent Application No. 1 84,1 87; European Patent Application 
No. 1 7 1 ,496; European Patent Application No. 1 73,494; PCT International Publication No. WO 
1 5 86/01 533; U.S. Patent No. 4,816,567; U.S. Pat. No. 5,225,539; European Patent Application No. 
125,023; Better, et al, 1988. Science 240: 1041-1043; Liu, et al, 1987. Proc. Natl Acad, Set 
USA 84: 3439-3443; Liu, etal, 1987. J. Immunol. 139: 3521-3526; Sun, etal, \9%l.Proc. Natl. 
Acad, Sci. USA 84: 214-218; Nishirnura, etal, 1987. Cancer Res. 41: 999-1005; Wood, etal, 
.1 985. Nature 3 14 :446-449; Shaw, et al, 1988. J. Natl Cancer Inst 80: 1553-1559); 
20 Morrison(1985) Science 229:1202-1207; Oi, et al (1986) BioTechniques 4:214; Jones, et al, 
1986. Nature 321 : 552-525; Vcrhocyan, et al t 1988. Science 239: 1534.; and Beidler, et al. t 
1988. 7. Immunol 141: 4053-4060. Each of me above citations are incorporated herein by 
reference in their entirety. 

In one embodiment, methods for the screening of antibodies that possess the desired 
25 specificity include, but are not limited to, enzyme-linked immunosorbent assay (ELISA) and 
other immunologically-mediated techniques known within the art. In a specific embodiment, 
selection of antibodies that are specific to a particular domain of an MOLX protein is facilitated 
by generation of hybridomas that bind to the fragment of an MOLX protein possessing such a 
domain. Thus, antibodies that are specific for a desired domain within an MOLX protein, or 
30 derivatives, fragments, analogs or homoiogs thereof, are also provided herein. 

Anti-MOLX antibodies may be used in methods known within the art relating to the 
localization and/or quantitation of an MOLX protein (e.g., for use in measuring levels of the 
MOLX protein within appropriate physiological samples, for use in diagnostic methods, for use 
in imaging the protein, and the like). In a given embodiment, antibodies for MOLX proteins, or 
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derivatives, fragments, analogs or hornologs thereof, that contain the antibody derived binding 
domain, are utilized as pharmacologically-active compounds (hereinafter "Therapeutics"). 

An anti-MOLX antibody (e.g., monoclonal antibody) can be used to isolate an MOLX 
polypeptide by standard techniques, such as affinity chromatography or immunoprecipitation. 
An anti-MOLX antibody can facilitate the purification of natural MOLX polypeptide from cells 
and of recombinantly-produced MOLX polypeptide expressed in host cells. Moreover, an 
anti-MOLX antibody can be used to detect MOLX protein (e.g., in a cellular lysate or cell 
supernatant) in order to evaluate the abundance and pattern of expression of the MOLX protein. 
Anti-MOLX antibodies can be used diagnostically to monitor protein levels in tissue as part of a 
clinical testing procedure, e.g., to, for example, determine the efficacy of a given treatment 
regimen. Detection can be facilitated by coupling (i.e., physically linking) the antibody to a 
detectable substance. Examples of detectable substances include various enzymes, prosthetic 
groups, fluorescent materials, luminescent materials, bioluminescent materials, and radioactive 
materials. Examples of suitable enzymes include horseradish peroxidase, alkaline phosphatase, 
P-galactosidase, or acetylcholinesterase; examples of suitable prosthetic group complexes 
include streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials include 
umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, dichlorotriazinylamine 
fluorescein, dansyl chloride or phycoerythrin; an example of a luminescent material includes 
luminol; examples of bioluminescent materials include luciferase, luciferin, and aequorin, and 
examples of suitable radioactive material include 125 1, 13 l I, 35 S or 3 H. 

MOLX Recombinant Expression Vectors and Host Cells 

Another aspect of the invention pertains to vectors, preferably expression vectors, 
containing a nucleic acid encoding an MOLX protein, or derivatives, fragments, analogs or 
hornologs thereof. As used herein, the term 'Vector" refers to a nucleic acid molecule capable of 
transporting another nucleic acid to which it has been linked. One type of vector is a "plasmid", 
which refers to a circular double stranded DNA loop into which additional DNA segments can 
be ligated. Another type of vector is a viral vector, wherein additional DNA segments can be 
ligated into the viral genome. Certain vectors are capable of autonomous replication in a host 
cell into which they are introduced (e.g., bacterial vectors having a bacterial origin of replication 
and episomal mammalian vectors). Other vectors (e.g., non-episomal mammalian vectors) are 
integrated into the genome of a host cell upon introduction into the host cell, and thereby are 
replicated along with the host genome. Moreover, certain vectors are capable of directing the 
expression of genes to which they are operatively-linked. Such vectors are referred to herein as 
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"expression vectors". In general, expression vectors of utility in recombinant DNA techniques 
are often in the form of plasmids. In the present specification, "plasmid" and "vector" can be 
used interchangeably as the plasmid is the most commonly used form of vector. However, the 
invention is intended to include such other forms of expression vectors, such as viral vectors 
5 (e.g., replication defective retroviruses, adenoviruses and adeno-associated viruses), which serve 
equivalent functions. 

The recombinant expression vectors of the invention comprise a nucleic acid of the 
invention in a form suitable for expression of the nucleic acid in a host cell, which means that the 
recombinant expression vectors include one or more regulatory sequences, selected on the basis 

10 of the host cells to be used for expression, that is operatively-tinked to the nucleic acid sequence 
to be expressed. Within a recombinant expression vector, "operably-linked" is intended to mean 
that the nucleotide sequence of interest is linked to the regulatory sequence(s) in a manner that 
allows for expression of the nucleotide sequence (e.g., in an in vitro transcription/translation 
system or in a host cell when the vector is introduced into the host cell). 

15 The term "regulatory sequence" is intended to includes promoters, enhancers and other 

expression control elements (e.g., polyadenylation signals). Such regulatory sequences are 
described, for example, in Goeddel, GENE EXPRESSION TECHNOLOGY: METHODS IN 
ENZYMOLOGY 185, Academic Press, San Diego, Calif. (1990). Regulatory sequences include 
those that direct constitutive expression of a nucleotide sequence in many types of host cell and 

20 those that direct expression of the nucleotide sequence only in certain host cells {e.g., 

tissue-specific regulatory sequences): It will be appreciated by those skilled in the art that the 
design of the expression vector can depend, on such factors as the choice of the host cell to be 
transformed, the level of expression of protein desired, etc. The expression vectors of the . 
invention can be introduced into host cells to thereby produce proteins or peptides, including 

25 fusion proteins or peptides, encoded by nucleic acids as described herein (e.g. , MOLX proteins, 
mutant forms of MOLX proteins, fusion proteins, etc.). 

The recombinant expression vectors of the invention can be designed for expression of 
MOLX proteins in prokaryotic or eukaryotic cells. For example, MOLX proteins can be 
expressed in bacterial cells such as Escherichia coli, insect cells (using baculovirus expression 

30 vectors) yeast cells or mammalian cells. Suitable host cells are discussed further in Goeddel, 
Gene Expression Technology: Methods in Enzymology 1 85, Academic Press, San Diego, 
Calif. (1990). Alternatively, the recombinant expression vector can be transcribed and translated 
in vitro, for example using T7 promoter regulatory sequences and T7 polymerase. 
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Expression of proteins in prokaiyotcs is most often carried out in Escherichia coli with 
vectors containing constitutive or inducible promoters directing the expression of either fusion or 
non-fusion proteins. Fusion vectors add a number of amino acids to a protein encoded therein, 
usually to the amino terminus of the recombinant protein. Such fusion vectors typically serve 
5 three purposes: (/) to increase expression of recombinant protein; (n) to increase the solubility of 
the recombinant protein; and (Hi) to aid in the purification of the recombinant protein by acting 
as a ligand in affinity purification. Often, in fusion expression vectors, a proteolytic cleavage 
site is introduced at the junction of the fusion moiety and the recombinant protein to enable 
separation of the recombinant protein from the fusion moiety subsequent to purification of the 

10 fusion protein. Such enzymes, and their cognate recognition sequences, include Factor Xa, 
thrombin and enterokinase. Typical fusion expression vectors include pGEX (Pharmacia 
Biotech Inc; Smith and Johnson, 1988. Gene 67: 31-40), pMAL (New England Biolabs, Beverly, 
Mass.) and pRTT5 (Pharmacia, Piscalaway, N J.) that fuse glutathione S-transferase (GST), 
maltose E binding protein, or protein A, respectively, to the target recombinant protein. 

1 5 Examples of suitable inducible non-fusion E. coli expression vectors include pTrc 

(Amrann et al. y (1988) Gene 69:301-31 5) and pET 1 Id (Studier et aL 9 GENE EXPRESSION 
TECHNOLOGY: METHODS INENZYMOLOGY 185, Academic Press, San Diego, Calif. (1990) 
. 60-89). 

One strategy to maximize recombinant protein expression in E. coli is to express the 
20 protein in a host bacteria with an impaired capacity to proteolytically cleave the recombinant 

protein. See, e.g:, Gottestnan, Gene EXPRESSION TECHNOLOGY: METHODS IN ENZYMOLOGY 1 85, 
Academic Press, San Diego, Calif. (1 990) 1 19-128. Another strategy is to alter the nucleic acid 
sequence of the nucleic acid to be inserted into an expression vector so that the individual codons 
for each amino acid are those preferentially utilized in E. coli (see, eg., Wada, et al. 9 1992. Nucl. 
25 Acids Res. 20: 21 1 1-2118). Such alteration of nucleic acid sequences of the invention can be 
carried out by standard DNA synthesis techniques. 

In another embodiment, the MOLX expression vector is a yeast expression vector. 
Examples of vectors for expression in yeast Saccharomyces cerivisae include pYepSecl 
(Baldari, etal, 1987. EMBOJ. 6: 229-234), pMFa (Kurjan and Herskowitz, 1982. Cell30: 
30 933-943), pJRY88 (Schultz et aU 1987. Gene 54: 1 13-123), pYES2 (Invitrogen Corporation, 
San Diego, Calif.), and picZ (InVitrogen Corp, San Diego, Calif.), 

Alternatively, MOLX can be expressed in insect cells using baculovirus expression 
vectors. Baculovirus vectors available for expression of proteins, in cultured insect cells (e.g. 9 
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SF9 cells) include the pAc series (Smith, et al. 9 1983. Mol Cell Biol. 3: 2156-2165) and the 
pVL series (Lucklow and Summers, 1989. Virology 170: 31-39). 

In yet another embodiment, a nucleic acid of the invention is expressed in mammalian 
ceils using a mammalian expression vector. Examples of mammalian expression vectors include 

5 pCDM8 (Seed, 1 987. Nature 329: 840) and pMT2PC (Kaufman, et al, 1 987. EMBOJ. 6: 
187-195). When used in mammalian cells, the expression vector's control functions are often 
provided by viral regulatory elements. For example, commonly used promoters are derived from 
polyoma, adenovirus 2, cytomegalovirus, and simian virus 40. For other suitable expression 
systems for both prokaryotic and eukaryotic cells see, e.g., Chapters 16 and 17 of Sambrook, et 

10 a/. s Molecular Cloning: A Laboratory Manual. 2nd ed., Cold Spring Harbor Laboratory, 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989. 

In another embodiment, the recombinant mammalian expression vector is capable of 
directing expression of the nucleic acid preferentially in a particular cell type (e.g., 
tissue-specific regulatory elements are used to express the nucleic acid). Tissue-specific 

1 5 regulatory elements are known in the art. Non-limiting examples of suitable tissue-specific 
promoters include the albumin promoter (liver-specific; Pinkert, et aL, 1987. Genes Dev. 1 : 
268-277), lymphoid-specific promoters (Calame and Eaton, 1988. Adv. Immunol 43: 235-275), 
in particular promoters of T cell receptors (Winoto and Baltimore, 1 989. EMBO 1 8 : 729-733) 
and immunoglobulins (Banerji, et a/., 1 983, Cell 33: 729-740; Queen and Baltimore, 1983. Cell 

20 33 : 741-748), neuron-specific promoters (e.g., the neurofilament promoter; Byrne and Ruddle, 
1989. Proc. Natl Acad. Set USA 86: 5473-5477)* pancreas-specific promoters (Edlund, et al 9 
1985*. Science 230: 912-91 6), and mammary gland-specific promoters (e.g., milk whey promoter, 
U.S. Pat. No. 4,873*316 and European Application Publication No. 264,166). Developmehtally- 
regulated promoters are also encompassed, e.g. , the murine hox promoters (Kessel and Gruss, 

25 1990, Science 249: 374-379) and the a-fetoprotein promoter (Campes and Tilghman, 1?89. 
Genes Dev. 3: 537-546). 

The invention further provides a recombinant expression vector comprising a DNA 
molecule of the invention cloned into the expression vector in an antisense orientation. That is, 
the DNA molecule is operatively-linked to a regulatory sequence in a manner that allows for 

30 expression (by transcription of the DNA molecule) of an RNA molecule that is antisense to 
MOLX mRN A. Regulatory sequences operatively linked to a nucleic acid cloned in the 
antisense orientation can be chosen that direct the continuous expression of the antisense RNA 
molecule in a variety of cell types, for instance viral promoters and/or enhancers, or regulatory 
sequences can be chosen that direct constitutive, tissue specific or cell type specific expression of 

111 



BNSOOCID: <WO 0161576A2_I_> 



WO 01/8157X 



PCT/US0 1/1 3578 



antisense RNA. The antisensc expression vector can be in the form of a recombinant plasmid, 
phagemid or attenuated vims in which antisense nucleic acids are produced under the control of 
a high efficiency regulatory region, the activity of which can be determined by the cell type into 
which the vector is introduced. For a discussion of the regulation of gene expression using 
5 antisense genes see, e.g., Weintraub, et a!., "Antisense RNA as a molecular tool for genetic 
analysis," Reviews-Trends in Genetics, Vol. 1(1) 1986. 

Another aspect of the invention pertains to host cells into which a recombinant 
expression vector of the invention has been introduced. The terms "host cell" and "recombinant 
host cell" are used interchangeably herein. It is understood that such terms refer not only to the 

10 particular subject cell but also to the progeny or potential progeny of such a cell. Because certain 
modifications may occur in succeeding generations due to either mutation or environmental 
influences, such progeny may not, in fact, be identical to the parent cell, but are still included 
within the scope of the term as used herein. 

A host cell can be any prokaryotic or eukaryotic cell. For example, MOTJX protein can 

1 5 be expressed in bacterial cells such as E. coli, insect cells, yeast or mammalian cells (such as 
Chinese hamster ovary cells (CHO) or COS cells). Other suitable host cells are known to those 
skilled in the art. 

Vector DNA can be introduced into prokaryotic or eukaryotic cells via conventional 
transformation or transfection techniques. As used herein, the terms "transformation' 1 and 

20 "transfection" are intended to refer to a variety of art-recognized techniques for introducing 
foreign nucleic acid (e.g., DNA) into a host cell, including calcium phosphate or calcium 
chloride co-precipitation, DEAE-dextran-mediated transfection, lipofection, or electroporation. 
Suitable methods for transforming of transferring host cells can be found id Sambrook, et al. 
(Molecular Cloning: A Laboratory Manual - 2nd ed., Cold Spring Harbor Laboratory, . 

25 Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989), and other laboratory 
manuals. 

For stable transfection of mammalian cells, it is known that, depending upon the 
expression vector and transfection technique used, only a small fraction of cells may integrate 
the foreign DNA into their genome. In order to identify and select these integrants, a gene that 
30 encodes a selectable marker (e.g., resistance to antibiotics) is generally introduced into the host 
cells along with the gene of interest. Various selectable markers include those that confer 
resistance to drugs, such as G41 8, hygromycin and methotrexate. Nucleic acid encoding a 
selectable marker can be introduced into a host cell on the same vector as that encoding MOLX 
or can be introduced on a separate vector. Cells stably transfected with the introduced nucleic 
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acid can be identified by drug selection (e.g., cells that have incorporated the selectable marker 
gene will survive, while the other cells die). 

A host cell of the invention, such as a prokaryotic or cukaryotic host cell in culture, can 
be used to produce (i.e., express) MOLX protein. Accordingly, the invention further provides 
5 methods for producing MOLX protein using the host cells of the invention. In one embodiment, 
the method comprises culturing the host cell of invention (into which a recombinant expression 
vector encoding MOLX protein has been introduced) in a suitable medium such that MOLX 
protein is produced. In another embodiment, the method further comprises isolating MOLX 
protein from the medium or the host cell. 

1 0 Transgenic MOLX Animals 

The host cells of the invention can also be used to produce non-human transgenic 
animals. For example, in one embodiment, a host cell of the invention is a fertilized oocyte or an 
embryonic stem cell into which MOLX protein-coding sequences have been introduced. Such 
host cells can then be used to create non-human transgenic animals in which exogenous MOLX 

1 5 sequences have been introduced into their genome or homologous recombinant animals in which 
endogenous MOLX sequences have been altered. Such animals are useful for studying the 
function and/or activity of MOLX protein and for identifying and/or evaluating modulators of 
MOLX protein activity. As used herein, a "transgenic animal" is a non-human animal, 
preferably a mammal, more preferably a rodent such as a rat or mouse, in which one or more of 

20 the cells of the animal includes a transgene. Other examples of transgenic animals include 
non-human primates, sheep, dogs, cows, goats, chickens, amphibians, etc. A transgene is 
exogenous DNA that is integrated into the genome of a cell from which a transgenic animal 
develops and. that remains in the genome of the mature animal, thereby directing the expression 
of an encoded gene product in one or more cell types or tissues of the transgenic animal. As 

25 used herein, a "homologous recombinant animal" is a non-human animal, preferably a mammal, 
more preferably a mouse, in which an endogenous MOLX gene has been altered by homologous 
recombination between the endogenous gene and an exogenous DNA molecule introduced into a 
cell of the animal, e.g. 9 an embryonic cell of the animal, prior to development of the animal. 
A transgenic animal of the invention can be created by introducing MOLX-encoding 

30 nucleic acid into the male pronuclei of a fertilized oocyte (e g., by microinjection, retroviral 
infection) and allowing the oocyte to develop in a pseudopregnant female foster animal. The 
human MOLX cDNA sequences of SEQ IDNOSil, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 
and 29 can be introduced as a transgene into the genome of a non-human animal. Alternatively, 
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a non-human homologue of the human MOLX gene, such as a mouse MOLX gene, can be 
isolated based on hybridization to the human MOLX cDNA (described further supra) and used 
as a transgene. Intronic sequences and polyadenylation signals can also be included in the 
transgene to increase the efficiency of expression of the transgene. A tissue-specific regulatory 
5 sequence(s) can be operably-linked to the MOLX transgene to direct expression of MOLX 

protein to particular cells. Methods for generating transgenic animals via embryo manipulation 
and microinjection, particularly animals such as mice, have become conventional in the art and 
are described, for example, in U.S. Patent Nos. 4,736,866; 4,870,009; and 4,873,191; and Hogan, 
1 986. In: MANIPULATING THE MOUSE EMBRYO, Cold Spring Harbor Laboratory Press, Cold 

10 Spring Harbor, N.Y. Similar methods are used for production of other transgenic animals. A 
transgenic founder animal can be identified based upon the presence of the MOLX transgene in 
its genome and/or expression of MOLX mRNA in tissues or cells of the animals. A transgenic 
founder animal can then be used to breed additional animals carrying the transgene. Moreover, 
transgenic animals carrying a transgene-encoding MOLX protein can further be bred to other 

15 transgenic animals carrying other transgenes. 

To create a homologous recombinant animal, a vector is prepared which contains at least 
a portion of an MOLX gene into which a deletion, addition or substitution has been introduced to 
thereby alter, eg., functionally disrupt, the MOLX gene. The MOLX gene can be a human gene 
(eg., the cDNA of SEQ ID NOS:l, 3, 5, 7,9, 11, 13, 15, 17, 19, 21, 23, 25, 27 r and29), but 

20 more preferably, is a non-human homologue of a human MOLX gene. For example, a mouse 
homologue of human MOLX gene of SEQ ID NOS: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 
27, and 29 can be used to construct a homologous recombination vector suitable for altering an 
endogenous MOLX gene in the mouse genome. In one embodiment, the vector is designed such 
that, upon homologous recombination, the endogenous MOLX gene is functionally disrupted 

25 (i.e., no longer encodes a functional protein; also referred to as a "knock out" vector). 

Alternatively, the vector can be designed such that, upon homologous recombination, the 
endogenous MOLX gene is mutated or otherwise altered but still encodes functional protein 
(e.g. , the upstream regulatory region can be altered to thereby alter the expression of the 
endogenous MOLX protein). In the homologous recombination vector, the altered portion of the 

30 MOLX gene is flanked at its 5 f - and 3 ! -termini by additional nucleic acid of the MOLX gene to 
allow for homologous recombination to occur between the exogenous MOLX gene carried by 
the vector and an endogenous MOLX gene in an embryonic stem cell. The additional flanking 
MOLX nucleic acid is of sufficient length for successful homologous recombination with the 
endogenous gene. Typically, several kilobases of flanking QNA (both at the 5- and S'-tennini) 
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are included in the vector. See, e.g., Thomas, et a/., 1987. Cell 51: 503 for a description of 
homologous recombination vectors. The vector is ten introduced into an embryonic stem cell 
line (e.g., by electroporation) and cells in which the introduced MOLX gene has homologously- 
recombined with the endogenous MOLX gene are selected. See, e.g., Li, et al, 1992. Cell 69: 
5 915. 

The selected cells are then injected into a blastocyst of an animal (e.g., a mouse) to form 
aggregation chimeras. See, e.g., Bradley, 1987. In: TERATOCARCiNOMAS AND EMBRYONIC STEM 
CELLS: A PRACTICAL APPROACH, Robertson, ed. IRL, Oxford, pp. 1 13-152. A chimeric embryo 
can then be implanted into a suitable pseudopregnant female foster animal and the embryo 

1 0 brought to term. Progeny harboring the hornologously-rccombined DNA in their germ cells can 
be used to breed animals in which all cells of the animal contain the homologously-recombincd 
DNA by germline transmission of the transgene. Methods for constructing homologous 
recombination vectors and homologous recombinant animals are described further in Bradley, 
1991. Curr. Opin. Biotechnol. 2: 823-829; PCT International Publication Nos.: WO 90/11354; 

1 5 WO 91/01 140; WO 92/0968; and WO 93/04169. 

In another embodiment, transgenic non-humans animals can be produced that contain 
selected systems that allow for regulated expression of the transgene. One example of such a 
system is the cre/loxP recombinase system of bacteriophage PI. For a description of the 
cre/loxP recombinase system, See, e.g., Lakso, et al, 1992. Proc. Natl, Acad. ScL USA 89: 

20 6232-6236. Another example of a recombinase system is the FLP recombinase system of 

Saccharomyces cerevisiae. See t O'Gorman, etal, 1991. Science 251:1351-1355. If a cre/loxP 
recombinase system is used to regulate expression of the transgene, animals containing 
transgenes encodingboth the Cre recombinase and a selected protein are required. Such animals 
can be provided through the construction of "double" transgenic animals, e.g. , by mating two 

25 transgenic animals, one containing a transgene encoding a selected protein and the other 
containing a transgene encoding a recombinase. 

Clones of the non-human transgenic animals described herein can also be produced 
according to the methods described in Wilmut, et al, 1997. Nature 385: 810-813. In brief, a cell 
(e.g., a somatic cell) from the transgenic animal can be isolated and induced to exit the growth 

30 cycle and enter G 0 phase. The quiescent cell can then be fused, e.g., through the use of electrical 
pulses, to an enucleated oocyte from an animal of the same species from which the quiescent cell 
is isolated. The reconstructed oocyte is then cultured such that it develops to morula or 
blastocyte and then transferred to pseudopregnant female foster animal. The offspring borne of 
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this female foster animal will be a clone of the animal from which the cell {e.g., the somatic cell) 
is isolated. 

Pharmaceutical Compositions 

The MOLX nucleic acid molecules, MOLX proteins, and anti-MOLX antibodies (also 
5 referred to herein as "active compounds") of the invention, and derivatives, fragments, analogs 
and homologs thereof, can be incorporated into pharmaceutical compositions suitable for 
administration. Such compositions typically comprise the nucleic acid molecule, protein, or 
antibody and a pharmaceutically acceptable carrier. As used herein, fl phaTmaceutically 
acceptable carrier" is intended to include any and all solvents, dispersion media, coatings, 

10 antibacterial and antifungal agents, isotonic and absorption delaying agents, and the like, 
compatible with pharmaceutical administration. Suitable carriers are described in the most 
recent edition of Remington's Pharmaceutical Sciences, a standard reference text in the field, 
which is incorporated herein by reference. Preferred examples of such carriers or diluents 
include, but are not limited to, water, saline, finger's solutions, dextrose solution, and 5% human 

15 scrum albumin. Liposomes and non-aqueous vehicles such as fixed oils may also be used. The 
use of such media and agents for pharmaceutically active substances is well known in the art. 
Except insofar as any conventional media or agent is incompatible with the active compound, 
use thereof in the compositions is contemplated. Supplementary active compounds can also be 
incorporated into the compositions. 

20 A pharmaceutical composition of the invention is formulated to be compatible with its 

intended route of administration. Examples of routes of administration include parenteral, e.g., 
intravenous, intra^ . 
transmucosal, and rectal administration. Solutions or suspensions used for parenteral, 
intradermal, or subcutaneous application can include the following components: a sterile diluent 

25 such as water for injection, saline solution, fixed oils, polyethylene glycols, glycerine, propylene 
glycol or other synthetic solvents; antibacterial agents such as benzyl alcohol or methyl 
parabens; antioxidants such as ascorbic acid or sodium bisulfite; chelating agents such as 
ethylenediaminetetraacetic acid (EDTA); buffers such as acetates, citrates or phosphates, and 
agents for the adjustment of tonicity such as sodium chloride or dextrose. The pH can be 

30 adjusted with acids or bases, such as hydrochloric acid or sodium hydroxide. The parenteral 
preparation can be enclosed in ampoules, disposable syringes or multiple dose vials made of 
glass or plastic. 
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Pharmaceutical compositions suitable for injectable use include sterile aqueous solutions 
(where water soluble) or dispersions and sterile powders for the extemporaneous preparation of 
sterile injectable solutions or dispersion. For intravenous administration, suitable carriers 
include physiological saline, bacteriostatic water, Cremophor EL™ (BASF, Parsippany, NJ.) or 

5 phosphate buffered saline (PBS). In all cases, the composition must be sterile and should be 
fluid to the extent that easy syringeability exists. It must be stable under the conditions of 
manufacture and storage and must be preserved against the contaminating action of 
microorganisms such as bacteria and fungi. The carrier can be a solvent or dispersion medium 
containing, for example, water, ethanol, polyol (for example, glycerol, propylene glycol, and 

1 0 liquid polyethylene glycol, and the like), and suitable mixtures thereof. The proper fluidity can 
be maintained, for example, by the use of a coating such as lecithin, by the maintenance of the 
required particle size in the case of dispersion and by the use of surfactants. Prevention of the 
action of microorganisms can be achieved by various antibacterial and antifungal agents, for 
example, parabens, chlorobutanol, phenol, ascorbic acid, thimerosal, and the like. In many 

1 5 cases, it will be preferable to include isotonic agents, for example, sugars, polyalcohols such as 
manitol, sorbitol, sodium chloride in the composition. Prolonged absorption of the injectable 
compositions can be brought about by including in the composition an agent which delays 
absorption, for example, aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by incorporating the active compound (e,g., 

20 an MOLX protein or anti-MOLX antibody) in the required amount in an appropriate solvent with 
one or a combination. of ingredients enumerated above, as required, followed by filtered 
sterilization. Generally, dispersions are prepared by incorporating the active compound into a 
sterile vehicle that contains a basic dispersion medium and the required other ingredients from 
those enumerated above. In the case of sterile powders for the preparation of sterile injectable 

25 solutions, methods of preparation are vacuum drying and freeze-drying that yields a powder of 
the active ingredient plus any additional desired ingredient from a previously sterile-filtered 
solution thereof. 

Oral compositions generally include an inert diluent or an edible carrier. They can be 
enclosed in gelatin capsules or compressed into tablets. For the purpose of oral-therapeutic- — 
30 administration, the active compound can be incorporated with excipients and used in the form of 
tablets, troches, or capsules. Oral compositions can also be prepared using a fluid carrier for use 
as a mouthwash, wherein the compound in the fluid carrier is applied orally and swished and 
expectorated or swallowed. Pharmaceutical^ compatible binding agents, and/or adjuvant 
materials can be included as piart of the composition. The tablets, pills, capsules, troches and the 
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like can contain any of the following ingredients, or compounds of a similar nature: a binder 
such as microcrystalline cellulose, gum tragacanth or gelatin; an excipient such as starch or 
lactose, a disintegrating agent such as alginic acid, Primogel, or com starch; a lubricant such as 
magnesium stearate or Sterotes; a glidant such as colloidal silicon dioxide; a sweetening agent 
5 such as sucrose or saccharin; or a flavoring agent such as peppermint, methyl salicylate, or 
orange flavoring. 

For administration by inhalation, the compounds are delivered in the form of an aerosol 
spray from pressured container or dispenser which contains a suitable propellant, e.g.> a gas such 
as carbon dioxide, or a nebulizer. 

1 0 Systemic administration can also be by transmucosal or transdermal means. For 

transmucosal or transdermal administration, penetrants appropriate to the barrier to be pemieated 
are used in the formulation. Such penetrants are generally known in the art, and include, for 
example, for transmucosal administration, detergents, bile salts, and fusidic acid derivatives. 
Transmucosal administration can be accomplished through the use of nasal sprays or 

1 5 suppositories. For transdermal adniinistration, the active compounds aTe formulated into 
ointments, salves, gels, or creams as generally known in the art. 

The compounds can also be prepared in the form of suppositories (e.g, , with conventional 
suppository bases such as cocoa butter and other glycerides) or retention enemas for rectal 
delivery. 

20 In one embodiment, the active compounds are prepared with carriers that will protect the 

compound against rapid elimination from the body, such as a controlled release formulation, 
including implants and microencapsulated delivery systems. Biodegradable, biocompatible 
polymers can be used, such as ethylene vinyl acetate, polyanhydrides, polyglycolic acid, 
collagen, polyorthoesters, and polylactic acid . Methods forpreparation of such formulations will 

25 be apparent to those skilled in the art. The materials can also be obtained commercially from 
Alza Corporation and Nova Pharmaceuticals, Inc. Liposomal suspensions (including liposomes 
targeted to infected cells with monoclonal antibodies to viral antigens) can also be used as 
phannaceutically acceptable carriers. These can be prepared according to methods known to 
those skilled in the art, for example, as described in U.S. Patent No. 4,522,81 1. 

30 It is especially advantageous to formulate oral or parenteral compositions in dosage unit 

form for ease of administration and uniformity of dosage. Dosage unit form as used herein refers 
to physically discrete units suited as unitary dosages for the subject to be treated; each unit 
containing a predetermined quantity of active compound calculated to produce, the desired 
therapeutic effect in association with the required pharmaceutical carrier. The specification for 
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the dosage unit foims ofthe invention are dictated by and directly dependent on the unique 
characteristics ofthe active compound and the particular therapeutic effect to be achieved, and 
the limitations inherent in the art of compounding such an active compound for the treatment of 
individuals. 

5 The nucleic acid molecules of the invention can be inserted into vectors and used as gene 

therapy vectors. Gene therapy vectors can be delivered to a subject by, for example, intravenous 
injection, local administration (see, e.g, U.S. Patent No. 5,328,470) or by stereotactic injection 
(see, e.g., Chen, et al, 1994. Proc. Natl Acad. Set USA 91: 3054-3057). The pharmaceutical 
preparation ofthe gene therapy vector can include the gene therapy vector in an acceptable 

1 0 diluent, or can comprise a slow release matrix in which the gene delivery vehicle is imbedded. 
Alternatively, where the complete gene delivery vector can be produced intact from recombinant 
cells, e.g., retroviral vectors, the pharmaceutical preparation can include one or more cells that 
produce the gene delivery system. 

The pharmaceutical compositions can be included in a container, pack, or dispenser 

1 5 together with instructions for administration. 

Screening and Detection Methods 

The isolated nucleic acid molecules of the invention can be used to express MOLX 
protein (e.g. , via a recombinant expression vector in a host cell in gene therapy applications), to 
detect MOLX mRNA (e.g., in a biological sample) or a genetic lesion in an MOLX gene,. and to 

20 modulate MOLX activity, as described further, below. In addition, the MOLX proteins can be 
used to screen drugs or compounds that modulate the MOLX protein activity or expression as 
well as to treat disorders characterized by insufficient or excessive production of MOLX protein 
or production of MOLX protein forms that have decreased or aberrant activity compared to 
MOLX wild-type protein (e.g.\ diabetes (regulates insulin release); obesity (binds and transport 

25 lipids); metabolic disturbances associated with obesity, the metabolic syndrome X as well as 
anorexia and wasting disorders associated with chronic diseases and various cancers, and 
infectious disease(possesses anti-microbial activity) and the various dyslipidemias. In addition, 
the anti-MOLX antibodies of the invention can be used to detect and isolate MOLX proteins and 
modulate MOLX activity. In yet a further aspect, the invention can be used in methods to 

30 influence appetite, absorption of nutrients and the disposition of metabolic substrates in both a 
positive and negative fashion. 

The invention further pertains to novel agents identified by the screening assays 
described herein and uses thereof for treatments as described, supra, 
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Screening Assays 

The invention provides a method (also referred to herein as a "screening assay") for 
identifying modulators, i.e., candidate or test compounds or agents (e.g. , peptides, 
5 peptidomirnetics, small molecules or other drugs) that bind to MOLX proteins or have a 

stimulatory or inhibitory effect on, e.g., MOLX protein expression or MOLX protein activity. 
The invention also includes compounds identified in the screening assa3's described herein. 

In one embodiment, the invention provides assays for screening candidate or test 
compounds which bind to or modulate the activity of the rnembrane-bound form of an MOLX 

1 0 protein or polypeptide or biologically-active portion thereof. The test compounds of the 
invention can be obtained using any of the numerous approaches in combinatorial library 
methods known in the art, including: biological libraries; spatially addressable parallel solid 
phase or solution phase libraries; synthetic library methods requiring deconvolution; the 
"one-bead one-compound" library method; and synthetic library methods using affinity 

1 5 chromatography selection. The biological library approach is limited to peptide libraries, while 
the other four approaches are applicable to peptide, non-peptide oligomer or small molecule 
libraries of compounds. See, e.g., Lam, 1997 . Anticancer Drug Desigti 12: 145. 

A "small molecule" as used herein, is meant to refer to a composition that has a 
molecular weight of less than about 5 kD and most preferably less than about 4 kD. Small 

20 molecules can be, e.g. , nucleic acids, peptides, polypeptides, peptidornimetics, carbohydrates, 

lipids or other organic or inorganic molecules. Libraries of chemical and/or biological mixtures, . 
such as fungal* bacterial, or algal extracts, are known in the art and can be screened with any of 
the assays of the invention. 

Examples of methods for the synthesis of molecular libraries can be found in the art, for 

25 example in: DeVVitt, etal, 1993. Proc. Natl Acad. Sci. U.S.A. 90: 6909; Erb, et al., 1994. Proc. 
Natl Acad. Sci. U.S.A. 91: 1 1422; Zuckermann, etal, 1994. /. Med. Chem. 37: 2678; Cho, etal, 
1 993. Science 261 : 1303; Carrell, et al. y 1994. Angew. Chem. Int. Ed Engl. 33: 2059; Carell, et 
ai, 1994. Angew. Chem. Int. Ed. Engl. 33: 2061 ; and Gallop, et al., 1994. /. Med Chem. 37: 
1233. 

30 Libraries of compounds may be presented in solution (e.g. , Houghten, 1 992. 

Biotechniques 13:412-421), or on beads (Lam, 1991. Nature 354: 82-84), on chips (Fodor, 1993. 
Nature 364: 555-556), bacteria (Ladner, U.S. Patent No. 5,223,409), spores (Ladncr, U.S. Patent 
. 5,233,409), plasmids (Cull, et al., 1992. Proc. Natl Acad. Sci. USA 89: 1865-1869) or on phage 
(Scott and Smith, 1990. Science 249: 386-390; Devlin, 1990. Science 249: 404-406; Cwirla, et 
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al t 1990. Proc. Natl. Acad Sci. U.S.A. 87: 6378-6382; Felici, 1991. J. Mol Biol 222: 301-310; 
Ladner, U.S. Patent No. 5,233,409.). 

In one embodiment, an assay is a cell-based assay in which a cell which expresses a 
membrane-bound form of MOLX protein, or a biologically-active portion thereof, on the cell 
5 surface is contacted with a test compound and the ability of the test compound to bind to an 
MOLX protein determined. The cell, for example, can of mammalian origin or a yeast cell. 
Determining the ability of the test compound to bind to the MOLX protein can be accomplished, 
for example, by coupling the test compound with a radioisotope or enzymatic label such that 
binding of the test compound to the MOLX protein or biologically-active portion thereof can be 

10 determined by detecting the labeled compound in a complex. For example, test compounds can 
be labeled with 125 1, 35 S, 14 C, or 3 H, either directly or indirectly, and the radioisotope detected by 
direct counting of radioemission or by scintillation counting. Alternatively, test compounds can 
be enzymatically-labeled with, for example, horseradish peroxidase, alkaline phosphatase, or 
luciferase, and the enzymatic label detected by determination of conversion of an appropriate 

15 substrate to product. In one embodiment, the assay comprises contacting a cell which expresses 
a membrane-bound form of MOLX protein, or a biologically-active portion thereof, on the cell 
surface with a known compound which binds MOLX to form an assay mixture, contacting the 
assay mixture with a test compound, and determining the ability of the test compound to interact 
with an MOLX protein, wherein determining the ability of the test compound to. interact, with an 

20 MOLX protein comprises determining the ability of the test compound to preferentially bind to 
MOLX protein or a biologically-active portion thereof as compared to the known compound. 

In another embodiment, an assay is a cell-based assay comprising contacting a cell 
expressing a membrane-bound form of MOLX protein, or a biologically-active portion thereof, 
on the cell surface with a test compound and determining the ability of the test compound to 

25 modulate {e.g. , stimulate or inhibit) the activity of the MOLX protein or biologically-active 

portion thereof Determining the ability of the test compound to modulate the activity of MOLX 
or a biologically-active portion thereof can be accomplished, for example, by determining the 
ability of the MOLX protein to bind to or interact with an MOLX target molecule. As used 
herein, a "target molecule" is a molecule with which an MOLX protein binds or interacts in 

30 nature, for example, a molecule on the surface of a cell which expresses an MOLX interacting 
protein, a molecule on the surface of a second cell, a molecule in the extracellular milieu, a 
molecule associated with the internal surface of a cell membrane or a cytoplasmic molecule. An 
MOLX target molecule can be a non-MOLX molecule or an MOLX protein or polypeptide of 
the invention. In one embodiment, an MOLX target molecule is a component of a signal 

121 



BNSOOCID: <WO 0181678A2j_> 



WO 01/81578 



PCT/US01/13578 



transduction pathway that facilitates transduction of an extracellular signal (e.g. a signal 
generated by binding of a compound to a membrane-bound MOLX molecule) through the cell 
membrane and into the cell. The target, for example, can be a second intercellular protein that 
has catalytic activity or a protein that facilitates the association of downstream signaling 
5 molecules with MOLX. 

Determining the ability of the MOLX protein to bind to or interact with an MOLX target 
molecule can be accomplished by one of the methods described above for determining direct 
binding. In one embodiment, determining the ability of the MOLX protein to bind to or interact 
with an MOLX target molecule can be accomplished by determining the activity of the target 

10 molecule. For example, the activity of the target molecule can be determined by detecting 

induction of a cellular second messenger of the target (i.e. intracellular Ca 2+ , diacylglycerol, IP 3 , 
etc.), detecting catalytic/enzymatic activity of the target an appropriate substrate, detecting the 
induction of a reporter gene (comprising an MOLX-responsive regulatory element operatively 
linked to a nucleic acid encoding a detectable marker, e.g., luciferase), or detecting a cellular 

1 5 response, for example, cell survival, cellular differentiation, or cell proliferation. 

In yet another embodiment, an assay of the invention is a cell-free assay comprising 
contacting an MOLX protein or biologically-active portion thereof with a test compound and 
determining the ability of the test compound to bind to the MOLX protein or biologically-active 
portion thereof. Binding of the test compound to the MOLX protein can be determined either 

20 directly or indirectly as described above. In one such embodiment, the assay comprises 

contacting the MOLX protein or biologically-active portion thereof with a known compound 
which binds MOLX to form an assay mixture, contacting the assay mixture with a test 
compound, and detennining the ability of the test compound to interact with an MOLX protein, 
wherein determining the ability of the test compound to interact with an MOLX protein 

25 comprises determining the ability of the test compound to preferentially bind to MOLX or 
biologically-active portion thereof as compared to the known compound. 

In still another embodiment, an assay is a cell-free assay comprising contacting MOLX 
protein or biologically-active portion thereof with a test compound and determining the ability of 
the test compound to modulate (e.g. stimulate or inhibit) the activity of the MOLX protein or 

30 biologically-active portion thereof. Detennining the ability of the test compound to modulate the 
activity of MOLX can be accomplished, for example, by determining the ability of the MOLX 
protein to bind to an MOLX target molecule by one of the methods described above for 
determining direct binding. In an alternative' embodiment, determining the ability of the test 
compound to modulate the activity of MOLX protein can be accomplished by detennining the 
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ability of the MOLX protein farther modulate an MOLX target molecule. For example, the 
catalytic/enzymatic activity of the target molecule on an appropriate substrate can be determined 
as described, supra. 

In yet another embodiment, the cell-free assay comprises contacting the MOLX protein 
5 or biologically-active portion thereof with a known compound which binds MOLX protein to 
form an assay mixture, contacting the assay mixture with a test compound, and determining the 
ability of the test compound to interact with an MOLX protein, wherein determining the ability 
of the test compound to interact with an MOLX protein comprises determining the ability of the 
MOLX protein to preferentially bind to or modulate the activity of an MOLX target molecule. 

10 The cell-free assays of the invention are amenable to use of both the soluble form or the 

membrane-bound form of MOLX protein. In the case of cell-free assays comprising the 
membrane-bound form of MOLX protein, it may be desirable to utilize a solubilizing agent such 
that the membrane-bound form of MOLX protein is maintained in solution. Examples of such 
solubilizing agents include non-ionic detergents such as n-octylglucoside, n-dodccylglucosidc, 

1 5 n-dodecylmaltoside, octanoyl-N-methylglucamide, decanoyl-N-methylglucamide, Triton® 
X-100, Triton® X-114, Thesit® Isotridecypoly(ethylene glycol ether)„, N-dodecyl- 
N,N-dimethyl-3-ammonio-l-pTopane sulfonate, 3-(3-cholamidopropyl) dimethylamminiol- 
1 -propane sulfonate (CHAPS), or 3-(3-cholamidopropyl)dimethylammmiol-2-hydroxy- 
1 -propane sulfonate (CHAPSO). 

20 In more than one embodiment of the above assay methodsr of the invention, it may be 

desirable to immobilize either MOLX protein or its target molecule to facilitate separation of 
complexed from uncomplexed forms of one of both of the proteins, as well as to accommodate 
automation of the assay. Binding of a test compound to MOLX protein, or interaction of MOLX 
protein with a target molecule in the presence and abiserice of a candidate compound, can be 

25 accomplished in any vessel suitable for containing the reactants. Examples of such vessels 

include microtiter plates, test tubes, and micro-centrifuge tubes. In one embodiment, a fusion 
protein can be provided that adds a domain that allows one or both of the proteins to be bound to 
a matrix. For example, GST-MOLX fusion proteins or GST-target fusion proteins can be 
adsorbed onto glutathione sepharose beads (Sigma Chemical, St. Louis, MO) or glutathione 

30 derivatized microtiter plates, that are then combined with the test compound or the test 

compound and either the non-adsorbed target protein or MOLX protein, and the mixture is 
incubated under conditions conducive to complex formation (e.g., at physiological conditions for 
salt and pH). Following incubation, the beads or microtiter plate wells are washed to remove 
any unbound components, the matrix immobilized in the case of beads, complex determined 
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either directly or indirectly, for example, as described, supra. Alternatively, the complexes can 
be dissociated from the matrix, and the level of MOLX protein binding or activity determined 
using standard techniques. 

Other techniques for immobilizing proteins on matrices can also be used in the screening 
5 assays of the invention. For example, either the MOLX protein or its target molecule can be 
immobilized utilizing conjugation of biotin and streptavidin. Biotinylated MOLX protein or 
target molecules can be prepared from biotin-NHS (N-hydroxy-succinimide) using techniques 
well-known within the art (e.g. , biotinylation kit, Pierce Chemicals, Rockford, III), and 
immobilized in the wells of streptavidin-coated 96 well plates (Pierce Chemical). Alternatively, 

1 0 antibodies reactive with MOLX protein or target molecules, but which do not interfere with 

binding of the MOLX protein to its target molecule, can be derivatized to the wells of the plate, 
and unbound taTget or MOLX protein trapped in the wells by antibody conjugation. Methods for 
detecting such complexes, in addition to those described above for the GST-immobilized 
complexes, include immunodetection of complexes using antibodies reactive with the MOLX 

1 5 protein or target molecule, as well as enzyme-linked assays that rely on detecting an enzymatic 
activity associated with the MOLX protein or target molecule. 

In another embodiment, modulators of MOLX protein expression are identified in a 
method wherein a cell is contacted with a candidate compound and the expression of MOLX 
mRNA or protein in the cell is determined. The level of expression of MOLX mRNA or protein 

20 in the presence of the candidate compound is compared to the level of expression of MOLX 

mRNA or protein in the absence of the candidate compound. The candidate compound can then 
be identified as a modulator of MOLX mRNA or protein expression based upon this comparison. 
For example, when expression of MOLX mRNA or protein is greater (/. e. , statistically 
significantly greater) in the presence of the candidate compound than in its absence, the 

25 candidate compound is identified as a stimulator of MOLX mRNA or protein expression. 

Alternatively, when expression of MOLX mRNA or protein is less (statistically significantly 
less) in the presence of the candidate compound than in its absence, the candidate compound is 
identified as an inhibitor of MOLX mRNA or protein expression. The level of MOLX mRNA or 
protein expression in the cells can be determined by methods described herein for detecting 

30 MOLX mRNA or protein. 

In yet another aspect of the invention, the MOLX proteins can be used as "bait proteins" 
in a two-hybrid assay or three hybrid assay (see, e.g., U.S. Patent No. 5,283,317; Zervos, et al. y 
1993. Cellll: 223-232; Madura, et aL, 1993. /. Biol. Chem. 268: 12046-12054; Bartel, etaL> 
1993. Biotechniques 14: 920-924; Iwabuchi, et al, 1993. Oncogene 8: 1693^1696; and Brent 
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WO 94/10300), to identify other proteins that bind to or interact with MOLX ("MOLX-binding 
proteins" or "MOLX-bp") and modulate MOLX activity. Such MOLX-binding proteins are also 
likely to be involved in the propagation of signals by the MOLX proteins as, for example, 
upstream or downstream elements of the MOLX pathway. 

5 The two-hybrid system is based on the modular nature of most transcription factors, 

which consist of separable DNA-binding and activation domains. Briefly, the assay utilizes two 
different DNA constructs. In one construct, the gene that codes for MOLX is fused to a gene 
encoding the DNA binding domain of a known transcription factor (e.g., GAL-4). In the other 
construct, a DNA sequence, from a library of DNA sequences, that encodes an unidentified 

10 protein ("prey" or "sample") is fused to a gene that codes for the activation domain of the known 
transcription factor. If the "bait" and the "prey" proteins are able to interact, in vivo, forming an 
MOLX-dependent complex, the DNA-binding and activation domains of the transcription factor 
are brought into close proximity. This proximity allows transcription of a reporter gene (e.g., 
LacZ) that is operably linked to a transcriptional regulatory site responsive to the transcription 

1 5 factor. Expression of the reporter gene can be detected and cell colonics containing the 

functional transcription factor can be isolated and used to obtain the cloned gene that encodes the 
protein which interacts with MOLX 

The invention further pertains to novel agents identified by the aforementioned screening 
assays and uses thereof foT treatments as described herein. 

20 Detection Assays 

Portions or fragments of the cDNA sequences identified herein (and the corresponding 
complete gene sequences) can be used in numerous ways as polynucleotide reagents. By way of 
example, and not of limitation, these sequences can be used .to: (z) map their respective genes on 
a chromosome; and, thus, locate gene regions associated with genetic disease; (fi) identify an 
25 individual from a minute biological sample (tissue, typing); and (wi) aid in forensic identification 
of a biological sample. Some of these applications are described in the subsections, below. 

Chromosome Mapping 

Once the sequence (or a portion of the sequence) of a gene has been isolated, this 
30 sequence can be used to map the location of the gene on a chromosome. This process is called 
chromosome mapping. Accordingly, portions or fragments of the MOLX sequences, SEQ ID 
NOS:i,3,5,7, 9, II, 13, 15, 17,19,21,23, 25, 27, and 29, or fragments or derivatives thereof, 
can be used to map the location of the MOLX genes, respectively, on a chromosome. The 
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mapping of the MOLX sequences to chromosomes is an important first step in correlating these 
sequences with genes associated with disease. 

Briefly, MOLX genes can be mapped to chromosomes by preparing PCR primers 
(preferably 1 5-25 bp in length) from the MOLX sequences. Computer analysis of the MOLX, 
sequences can be used to rapidly select primers that do not span more than one exon in the 
genomic DNA, thus complicating the amplification process. These primers can then be used for 
PCR screening of somatic cell hybrids containing individual human chromosomes. Only those 
hybrids containing the human gene corresponding to the MOLX sequences will yield an 
amplified fragment 

Somatic cell hybrids are prepared by fusing somatic cells from different mammals (e.g., 
human and mouse cells). As hybrids of human and mouse cells grow and divide, they gradually 
lose human chromosomes in random order, but retain the mouse chromosomes. By using media 
in which mouse cells cannot grow, because they lack a particular enzyme, but in which human 
cells can, the one human chromosome that contains the gene encoding the needed enzyme will 
be retained. By using various media, panels of hybrid cell lines can be established. Each cell 
line in a panel contains either a single human chromosome or a small number of human 
chromosomes, and a full set of mouse chromosomes, allowing easy mapping of individual genes 
to specific human chromosomes. See, e.g., DTEustachio, el ah, 1983. Science 220: 919-924. 
Somatic cell hybrids containing only fragments of human chromosomes can also be produced by 
using human chromosomes with translocations and deletions. 

PCR mapping of somatic cell hybrids is a rapid procedure for assigning a particular 
sequence to a particular chromosome. Three or more sequences can be assigned per day using a 
single thermal cycler. Using.the MOLX sequences to design oligonucleotide primers, sub- 
localization can be achieved with panels of fragments from specific chromosomes. 

Fluorescence in situ hybridization (FISH) of a DNA sequence to a metaphase 
chromosomal spread can further be used to provide a precise chromosomal location in one step. 
Chromosome spreads can be made using cells whose division has been blocked in metaphase by 
a chemical like colcemid that disrupts the mitotic spindle. The chromosomes can be treated 
briefly with trypsin, and then stained with Giemsa. A pattern of light and dark bands develops 
on each chromosome, so that the chromosomes can be identified individually. The FISH 
technique can be used with a DNA sequence as short as 500 or 600 bases. However, clones 
larger than 1,000 bases have a higher likelihood of binding to a unique chromosomal location 
with sufficient signal intensity for simple detection. Preferably 1,000 bases, and more preferably 
2,000 bases, will suffice to get good results at a reasonable amount of time. For a review of this 
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technique, see, Verma, ei al. y HUMAN CHROMOSOMES: A MANUAL OF BASIC TECHNIQUES 
(Pergamon Press, New York 1988). 

Reagents for chromosome mapping can be used individually to mark a single 
chromosome or a single site on that chromosome, or panels of reagents can be used for marking 
5 multiple sites and/or multiple chromosomes. Reagents corresponding to noncoding regions of 
the genes actually are preferred for mapping purposes. Coding sequences are more likely to be 
conserved within gene families, thus increasing the chance of cross hybridizations during 
chromosomal mapping. 

Once a sequence has been mapped to a precise chromosomal location, the physical 

10 position of the sequence on the chromosome can be correlated with genetic map data. Such data 
are found, e.g., in McKusick, MENDELIAN INHERITANCE IN MAN, available on-line through Johns 
Hopkins University Welch Medical Library). The relationship between genes and disease, 
mapped to the same chromosomal region, can then be identified through linkage analysis 
(co-inheritance of physically adjacent genes), described in, e.g., Egeland, et al, 1987. Nature, 

15 325: 783-787. 

Moreover, differences in the DNA sequences between individuals affected and 
unaffected with a disease associated with the MOLX gene, can be determined. If a mutation is 
observed in some or all of the affected individuals but not in any unaffected individuals, then the 
mutation is likely to be the causative agent of the particular disease. Comparison of affected and 

20 unaffected individuals generally involves first looking for struck 

chromosomes, such as deletions or translocations that are visible from chromosome spreads or 
detectable using PCR based on that DNA sequence. Ultimately, complete sequencing of genes 
from* several individuals can be performed to confirm the presence of a mutation and to 
distinguish mutations from polymorphisms. 

25 

Tissue Typing 

The MOLX sequences of the invention can also be used to identify individuals from 
minute biological samples. In this technique, an individual's genomic DNA is digested with one 
or more restriction enzymes, and probed on a Southern blot to yield unique bands for 
30 identification. The sequences of the invention are useful as additional DNA markers for RFLP 
("restriction fragment length polymorphisms," described in U.S. Patent No. 5,272,057). 

Furthermore, the sequences of the invention can be used to provide an alternative 
technique that determines the actual base-by-basc DNA sequence of selected portions of an 
individual's genome. Thus, the MOLX sequences described herein can be used to prepare two 
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PCR primers from the 5'- and 3 ! -tenxini of the sequences. These primers can then be used to 
amplify an individual's DNA and subsequently sequence it 

Panels of corresponding DNA sequences from individuals, prepared in this manner, can 
provide unique individual identifications, as each individual will have a unique set of such DNA 
5 sequences due to allelic differences. The sequences of the invention can be used to obtain such 
identification sequences from individuals and from tissue. The MOLX sequences of the 
invention uniquely represent portions of the human genome. Allelic variation occurs to some 
degree in the coding regions of these sequences, and to a greater degree in the noncoding 
regions. It is estimated that allelic variation between individual humans occurs with a frequency 

10 of about once per each 500 bases. Much of the allelic variation is due to single nucleotide 
polymorphisms (SNPs), which include restriction fragment length polymorphisms (RFLPs). 

Each of the sequences described herein can, to some degree, be used as a standard against 
which DNA from an individual can be compared for identification purposes. Because greater 
numbers of polymorphisms occur in the noncoding regions, fewer sequences are necessary to 

15 differentiate individuals. The noncoding sequences can comfortably provide positive individual 
identification with a panel of perhaps 10 to 1,000 primers that each yield a noncoding amplified 
sequence of 1 00 bases, If predicted coding sequences, such as those in SEQ ID NOS:l, 3, 5, 7, 
9, 11, 13, 15, 17, 19, 21, 23, 25, 27, and 29 are used, a more appropriate number of primers for 
positive individual identification would be 500-2,000. 

20 

Predictive Medicine 

The invention also pertains to the field of predictive medicine in which diagnostic assays, 
. prognostic assays* pharmacogenomics, and monitoring. clinical trials are used for prognostic 
(predictive) purposes to thereby treat an individual prophylactically. Accordingly, one aspect of 

25 the invention relates to diagnostic assays for determining MOLX protein and/or nucleic acid 
expression as well as MOLX activity, in the context of a biological sample (e.g., blood, serum, 
cells, tissue) to thereby determine whether an individual is afflicted with a disease or disorder, or 
is at risk of developing a disorder, associated with aberrant MOLX expression or activity. The 
disorders include metabolic disorders, diabetes, obesity, infectious disease, anorexia, cancer- 

30 associated cachexia, cancer, neurodegenerative disorders, Alzheimer's Disease, Parkinson's 
Disorder, immune disorders, and hematopoietic disorders, and the various dyslipidemias, 
metabolic disturbances associated with obesity, the metabolic syndrome X and wasting 
.disorders.associated with chronic diseases and various cancers. The invention also provides for 
prognostic (or predictive) assays for determining whether an individual is at risk of developing q 
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disorder associated with MOLX protein, nucleic acid expression or activity. For example, 
mutations in an MOLX gene can be assayed in a biological sample. Such assays can be used for 
prognostic or predictive purpose to thereby prophylactically treat an individual prior to the onset 
of a disorder characterized by or associated with MOLX protein, nucleic acid expression, or 
5 biological activity. 

Another aspect of the invention provides methods for determining MOLX protein, 
nucleic acid expression or activity in an individual to thereby select appropriate therapeutic or 
prophylactic agents for that individual (referred to herein as "pharmacogenomics"). 
Pharmacogenomics allows foT the selection of agents (e.g., drugs) for therapeutic or prophylactic 
1 0 treatment of an individual based on the genotype of the individual (e.g. , the genotype of the 
individual examined to determine the ability of the individual to respond to a particular agent.) 

Yet another aspect of the invention pertains to monitoring the influence of agents (e.g., 
drugs, compounds) on the expression or activity of MOLX in clinical trials. 

These and other agents are described in further detail in the following sections. 

15 

Diagnostic Assays 

An exemplary method for detecting the presence or absence of MQLX in a biological 
sample involves obtaining a biological sample from a test subject and contacting the biological 
sample with a compound or an agent capable of detecting MOLX protein or nucleic acid (e.g. , 

20 mRNA, genomic DN A) that encodes MOLX protein such that the presence of MOLX is detected 
in the biological sample. An agent for detecting MOLX mRNA or genomic DNA is a labeled 
nucleic acid probe capable of hybridizing to MOLX mRNA or genomic DNA. The nucleic acid 
probe can be, for example, a full-length MOLX nucleic acid, such as the nucleic acid of SEQ ID 
NGS:1, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, and 29, or aportion thereof, such as an 

25 oligonucleotide of at least 15, 30, 50, 100, 250 or 500 nucleotides in length and sufficient to 
specifically hybridize under stringent conditions to MOLX mRNA or genomic DNA. Other 
suitable probes for use in the diagnostic assays of the invention are described herein. 

An agent for detecting MOLX protein is an antibody capable of binding to MOLX 
protein, preferably an antibody with a detectable label. Antibodies can be polyclonal, or more 

30 preferably, monoclonal. An intact antibody, or a fragment thereof (e.g., Fab or F(ab) 2 ) can be 
used. The term "labeled", with regard to the probe or antibody, is intended to encompass direct 
labeling of the probe or antibody by coupling (i.e., physically linking) a detectable substance to 
the probe or antibody, as well as indirect labeling of the probe or antibody by reactivity with 
another reagent that is directly labeled. Examples of indirect labeling include detection of a 
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primary antibody using a fluorescent] y-labeled secondary antibody and end-labeling of a DNA 
probe with biotin such that it can be detected with fluorescently-labeled streptavidin. The term 
"biological sample" is intended to include tissues, cells and biological fluids isolated from a 
subject, as well as tissues, cells and fluids present within a subject That is, the detection method 
of the invention can be used to detect MOLX mRNA, protein, or genomic DNA in a biological 
sample in vitro as well as in vivo. For example, in vitro techniques for detection of MOLX 
mRNA include Northern hybridizations and in situ hybridizations. In vitro techniques for 
detection of MOLX protein include enzyme linked immunosorbent assays (ELISAs), Western 
blots, immunoprecipilations, and immunofluorescence. In vitro techniques for detection of 
MOLX genomic DNA include Southern hybridizations. Furthermore, in vivo techniques for 
detection of MOLX protein include introducing into a subject a labeled anti-MOLX antibody. 
For example, the antibody can be labeled with a radioactive marker whose presence and location 
in a subject can be detected by standard imaging techniques. 

In one embodiment, the biological sample contains protein molecules from the test 
subject. Alternatively, the biological sample can contain mRNA molecules from the test subject 
or genomic DNA molecules from the test subject. A preferred biological sample is a peripheral 
blood leukocyte sample isolated by conventional means from a subject. 

In another embodiment, the methods further involve obtaining a control biological 
sample from a control subject, contacting the control sample with a compound or agent capable 
of detecting MOLX protein, mRNA, or genomic DNA, such that the presence of MOLX protein, 
mRNA or genomic DNA is detected in the biological sample, and comparing the presence of 
MOLX protein, mRNA or genomic DNA in the control sample with the presence, of MOLX 
protein, mRNA or genomic DNA in the test sample. 

The invention also encompasses kits for detecting the presence of MOLX in a biological 
sample. For example, the kit can comprise: a labeled compound or agent capable of detecting 
MOLX protein or mRNA in a biological sample; means for determining the amount of MOLX in 
the sample; and means for comparing the amount of MOLX in the sample with a standard. The 
compound or agent can be packaged in a suitable container. The kit can further comprise 
instructions for using the kit to detect MOLX protein or nucleic acid. 

Prognostic Assays 

The diagnostic methods described herein can furthermore be utilized to identify subjects 
having or at risk of developing a disease or disorder associated with aberrant MOLX expression 
or activity. For example, the assays described herein, such as the preceding diagnostic assays or 
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the following assays, can be utilized to identify a subject having or at risk of developing a 
disorder associated with MOLX protein, nucleic acid expression or activity. Alternatively, the 
prognostic assays can be utilized to identify a subject having or at risk for developing a disease 
or disorder. Thus, the invention provides a method for identifying a disease or disorder 

5 associated with aberrant MOLX expression or activity in which a test sample is obtained from a 
subject and MOLX protein or nucleic acid {e.g., mRNA, genomic DNA) is detected, wherein the 
presence of MOLX protein or nucleic acid is diagnostic for a subject having or at risk of 
developing a disease or disorder associated with aberrant MOLX expression or activity. As used 
herein, a "test sample" refers to a biological sample obtained from a subject of interest. For 

10 example, a test sample can be a biological fluid (e.g., serum), cell sample, or tissue. 

Furthermore, the prognostic assays described herein can be used to determine whether a 
subject can be administered an agent (e.g.. an agonist, antagonist, peptidomimetic, protein, 
peptide, nucleic acid, small molecule, or other drug candidate) to treat a disease or disorder 
associated with aberrant MOLX expression or activity. For example, such methods can be used 

1 5 to determine whether a subject can be effectively treated with an agent for a disorder. Thus, the 
invention provides methods for determining whether a subject can be effectively treated with an 
agent for a disorder associated with aberrant MOLX expression or activity in which a test sample 
is obtained and MOLX protein or nucleic acid is detected {e.g. t wherein the presence of MOLX 
protein or nucleic acid is diagnostic for a subject that can be administered the agent to treat a 

20 disorder associated with aberrant MOLX expression or activity). 

The methods of the invention can also be used to detect genetic lesions in an MOLX 
gene, thereby determining if a subject with the lesioned gene is at risk for a disorder 
characterized by aberrant cell proliferation and/or differentiation. In various embodiments, the 
methods include detecting, in a sample of cells from the subject, the presence or absence of a 

25 genetic lesion characterized by at least one of an alteration affecting the integrity of a gene 
encoding an MOLX-protein, or the misexnjession of the MOLX gene. For example, such 
genetic lesions can be detected by ascertaining the existence of at least one of; (/) a deletion of 
one or more nucleotides from an MOLX gene; (it) an addition of one or more nucleotides to an 
MOLX gene; (Hi) a substitution of one or more nucleotides of an MOLX gene, (iv) a 

30 chromosomal rearrangement of an MOLX gene; (v) an alteration in the level of a messenger 
RNA transcript of an MOLX gene, (vi) aberrant modification of an MOLX gene, such as of the 
methylation pattern of the genomic DNA, (vn) the presence of a non-wild-type splicing partem 
of a messenger RNA transcript of an MOLX gene, (vm) a non-wild-type level of an MOLX 
protein, (ix) allelic loss of an MOLX gene, and (x) inappropriate post-translational modification 
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of an MOLX protein. As described herein, there are a large number of assay techniques known 
in the art which can be used for detecting lesions in an MOLX gene. A preferred biological 
sample is a peripheral blood leukocyte sample isolated by conventional means from a subject. 
However, any biological sample containing nucleated cells may be used, including, for example, 
5 buccal mucosal cells. 

In certain embodiments, detection of the lesion involves the use of a probe/primer in a 
polymerase chain reaction (PCR) (see, e.g., U.S. Patent Nos. 4,683,195 and 4,683,202), such as 
anchor PCR or RACE PCR, or, alternatively, in a ligation chain reaction (LCR) (see, e.g., 
Landegran, et a/., 1988. Science 241: 1077-1080; and Nakazawa, et al, 1994. Proc. Natl Acad. 

10 Sci. USA 91 : 360-364), the latter of which can be particularly useful for detecting point 

mutations in the MOLX-gene (see, Abravaya, et al., 1 995. Nucl. Acids Res. 23: 675-682). This 
method can include the steps of collecting a sample of cells from a patient, isolating nucleic acid 
(<z-g-) genomic, mRNA or both) from the cells of the sample, contacting the nucleic acid sample 
with one or more primers that specifically hybridize to an MOLX gene under conditions such 

1 5 that hybridization and amplification of the MOLX gene (if present) occurs, and detecting the 
presence or absence of an amplification product, or detecting the size of the amplification 
product and comparing the length to a control sample. It is anticipated that PCR and/or LCR 
may be desirable to use as a preliminary amplification step in conjunction with any of the 
techniques used for detecting mutations described herein. 

20 Alternative amplification methods include: self sustained sequence replication (see, 

Guatelli, et al, 1990. Proc. Natl. Acad. Set USA 87: 1874-1878), transcriptional amplification 
system (see, Kwoh, et a/., 1989. Proc. Natl Acad. ScL USA 86: 1 173-1 177); Qp Replicase (see, 
Lizardi, et dl $ 1 988. BioTechnology 6: 1 2 97), or any other nucleic acid amplification method, 
followed by the detection of the amplified molecules using techniques well known to those of 

25 skill in the art These detection schemes are especially useful for the detection of nucleic acid 
molecules if such molecules are present in very low numbers. 

In an alternative embodiment, mutations in an MOLX gene from a sample cell can be 
identified by alterations in restriction enzyme cleavage patterns. For example, sample and 
control DNA is isolated, amplified (optionally), digested with one or more restriction 

30 endonucleases, and fragment length sizes are determined by gel electrophoresis and compared. 
Differences in fragment length sizes between sample and control DNA indicates mutations in the 
sample DNA. Moreover, the use of sequence specific ribozymes (see, e.g. t U.S. Patent No. 
5,493,531) can be used to score for the presence of specific mutations by development or loss of 
a ribozyme cleavage site. 
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In other embodiments, genetic mutations in MOLX can be identified by hybridizing a 
sample and control nucleic acids, e.g., DNA or RNA. to high-density arrays containing hundreds 
or thousands of oligonucleotides probes. See, e.g., Cronin, et a/., 1996. Human Mutation 7: 
244-255; Kozal, et a/., 1996. Nat. Med. 2: 753-759. For example, genetic mutations in MOLX 
5 can be identified in two dimensional arrays containing light-generated DNA probes as described 
in Cronin, et al, supra. Briefly, a first hybridization array of probes can be used to scan through 
long stretches of DNA in a sample and control to identify base changes between the sequences 
by making linear arrays of sequential overlapping probes. This step allows the identification of 
point mutations. This is followed by a second hybridization array that allows the 

1 0 characterization of specific mutations by using smaller, specialized probe arrays complementary 
to all variants or mutations detected. Each mutation array is composed of parallel probe sets, one 
complementary to the wild-type gene and the other complementary to the mutant gene. 

In yet another embodiment, any of a variety' of sequencing reactions known in the art can 
be used to directly sequence the MOLX gene and detect mutations by comparing the sequence of 

1 5 the sample MOLX with the corresponding wild-type (control) sequence. Examples of 

sequencing reactions include those based on techniques developed by Maxim and Gilbert, 1977. 
Proc. Natl. Acad. Sci. USA 14: 560 or Sanger, 1977. Proc. Natl Acad. Set USA 74: 5463. It is 
also contemplated that any of a variety of automated sequencing procedures can be utilized when 
performing the diagnostic assays (see, e.g., Naeve, et al, 1995. Bioiechniques 19: 448), 

20 including sequencing by mass spectrometry (see, eg., PCT International Publication No. WO 
94/16101; Cohen, et al, 1996. Adv. Chromatography 36: 127-162; and Griffin, et ah, 1993. 
Appl Biochem. Biotechnol 38: 147-159). 

Other methods for detecting mutations in the MOLX gene include methods in which 
protection from cleavage agents is used to detect mismatched bases in RNA/RNA or RNA/DN A 

25 heteroduplexes. See, e.g. t Myers, et a/., 1985. Science 230: 1242. In general, the art technique 
of "mismatch cleavage" starts by providing heteroduplexes of formed by hybridizing (labeled) 
KNA or DNA containing the wild-type MOLX sequence with potentially mutant RNA or DNA 
obtained from a tissue sample. The double-stranded duplexes are treated with an agent that 
cleaves single-stranded regions of the duplex such as which will exist due to basepair 

30 mismatches between the control and sample strands. For instance, RNA/DNA duplexes can be 
treated with KNase and DNA/DNA hybrids treated with Sj nuclease to enzyrnatically digesting 
the mismatched regions. In other embodiments, either DNA/DNA or RNA/DNA duplexes can 
be treated with hydroxylamine or osmium tetroxide and with piperidine in order to digest 
mismatched regions. After digestion of the mismatched regions, the resulting material is then 
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separated by size on denaturing polyacrylamide gels to determine the site of mutation. See, e.g., 
Cotton, et al, 198S. Proc. Natl. Acad. Set USA 85: 4397; Saleeba, et al, 1992. Methods 
EnzymoL 217: 286-295. In an embodiment, the control DNA or RNA can be labeled for 
detection. 

5 In still another embodiment, the mismatch cleavage reaction employs one or more 

proteins that recognize mismatched base pairs in double-stranded DNA (so called "DNA 
mismatch repair" enzymes) in defined systems for detecting and mapping point mutations in 
MOLX cDNAs obtained from samples of cells. For example, the mutY enzyme of E. coli 
cleaves A at G/A mismatches and the thymidine DNA glycosylase from HeLa cells cleaves T at 

10 G/T mismatches. See, e.g., Hsu, et al., 1994. Carcinogenesis 15: 1657-1662. According to an 
exemplary embodiment, a probe based on an MOLX sequence, e.g., a wild-type MOLX 
sequence, is hybridized to a cDNA or other DNA product from a test cell(s). The duplex is 
treated with a DNA mismatch repair enzyme, and the cleavage products, if any, can be detected 
from electrophoresis protocols or the like. See, e.g. t U.S. Patent No. 5,459,039. 

15 In other embodiments, alterations in clcctrophorctic mobility will be used to identify 

mutations in MOLX genes. For example, single strand conformation polymorphism (SSCP) 
may be used to detect differences in electrophoretic mobility between mutant and wild type 
nucleic acids. See, e.g t Orita, et aL, 1989. Proc. Natl Acad: Sci. USA: 86: 2766; Cotton, 1993. 
Mutat. Res. 285: 125-144; Hayashi, 1992. Genet. Anal. Tech. Appl 9: 73-79. Single-stranded 

20 DNA fragments of sample and control MOLX nucleic acids will be denatured and allowed to 
renature. The secondary structure of single-stranded nucleic acids varies according to sequence, 
the resulting alteration in electrophoretic mobility enables the detection of even a single base 
change. The DNA fragments may be labeled or detected with labeled probes. The sensitivity of 
the assay may be enhanced by using RNA (rather than DNA), in which the secondary structure is 

25 more sensitive to a change in sequence* In one embodiment, the subject method utilizes 
heteroduplex analysis to separate double stranded heteroduplex molecules on the basis of 
changes in electrophoretic mobility. See, e.g., Keen, etal, 1991. Trends Genet. 7: 5. 

In yet another embodiment, the movement of mutant or wild-type fragments in 
polyacrylamide gels containing a gradient of denaturant is assayed using denaturing gradient gel 

30 electrophoresis (DGGE). See, e.g., Myers, et al, 1985. Nature 313: 495. When DGGE is used 
as the method of analysis, DNA will be modified to insure that it does not completely denature, 
for example by adding a GC clamp of approximately 40 bp of high-melting GC-rich DNA by 
PCR- In a further embodiment, a temperature gradient is used in place of a denaturing gradient 
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to identify differences in the mobility of control and sample DNA. See, e.g., Rosenbaum and 
Reissner, 1987. Biophys. Chem. 265: 12753. 

Examples of other techniques for detecting point mutations include, but are not limited 
to, selective oligonucleotide hybridization, selective amplification, or selective primer extension. 

5 For example, oligonucleotide primers may be prepared in which the known mutation is placed 
centrally and then hybridized to target DNA under conditions that permit hybridization only if a 
perfect match is found. See, e.g., Saiki, et al, 1986. Nature 324: 163; Saiki, et al. 9 1989. Proc. 
Natl Acad. ScL USA 86: 6230. Such allele specific oligonucleotides are hybridized to PCR 
amplified target DNA or a number of different mutations when the oligonucleotides are attached 

10 to the hybridizing membrane and hybridized with labeled target DNA. 

Alternatively, allele specific amplification technology that depends on selective PCR 
amplification may be used in conjunction with the instant invention. Oligonucleotides used as 
primers for specific amplification may carry the mutation of interest in the center of the molecule 
(so that amplification depends on differential hybridization; see, e.g., Gibbs, et aL> 1989. Nucl. 

15 Acids Res. 17: 2437-2448) or at the extreme 3-terminus of one primer where, under appropriate 
conditions, mismatch can prevent, or reduce polymerase extension (see, e.g., Prossner, 1993. 
Tibteck. 1 1 : 238). In addition it may be desirable to introduce a novel restriction site in the 
region of the mutation to create cleavage-based detection. See, e.g., Gasparini, et ah, 1992. Mol 
Cell Probes 6: 1. It is anticipated that in certain embodiments amplification may also be 

20 performed using Taq ligase for amplification. See, e.g., Barany, 1991, Proc. Natl Acad. Sci. 

USA 88: 189. In such cases, ligation will occur only if there is a perfect match at the 3 -terminus 
of the 5' sequence, making it possible to detect the presence of a known mutation at a specific 
site by looking for the presence or absence of amplification; . 

The methods described herein may be performed, for example, by utilizing pre-packaged 

25 diagnostic kits comprising at least one probe nucleic acid or antibody reagent described herein, 
which may be conveniently used, e.g., in clinical settings to diagnose patients exhibiting 
symptoms or family history of a disease or illness involving an MOLX gene. 

Furthermore, any cell type or tissue, preferably peripheral blood leukocytes, in which 
MOLX is expressed may be utilized in the prognostic assays described herein. However, any 

30 biological sample containing nucleated cells may be used, including, for example, buccal 
mucosal cells. 
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Pharmacogenomics 

Agents, or modulators that have a stimulatory or inhibitory effect on MOLX activity 
(e.g., MOLX gene expression), as identified by a screening assay described herein can be 
5 administered to individuals to treat (prophylactically or therapeutically) disorders (The disorders 
include metabolic disorders, diabetes, obesity, infectious disease, anorexia, cancer-associated 
cachexia, cancer, neurodegenerative disorders, Alzheimer's Disease, Parkinson's Disorder, 
immune disorders, and hematopoietic disorders, and the various dyslipidemias, metabolic 
disturbances associated with obesity, the metabolic syndrome X and wasting disorders 

10 associated with chronic diseases and various cancers.) In conjunction with such treatment, the 
pharmacogenomics (i.e., the study of the relationship between an individual's genotype and that 
individual's response to a foreign compound or drug) of the individual may be considered. 
Differences in metabolism of therapeutics can lead to severe toxicity or therapeutic failure by 
altering the relation between dose and blood concentration of the pharmacologically active drug. 

15 Thus, the pharmacogenomics of the individual permits the selection of effective agents (e.g., 
drugs) for prophylactic or therapeutic treatments based on a consideration of the individual's 
genotype. Such pharmacogenomics can further be used to determine appropriate dosages and 
therapeutic regimens. Accordingly, the activity of MOLX protein, expression of MOLX nucleic 
acid, or mutation content of MOLX genes in an individual can be determined to thereby select 

20 appropriate agent(s) for therapeutic or prophylactic treatment of the individual. 

Pharmacogenomics deals with clinically significant hereditary variations in the response 
to drugs due to altered drug disposition and.abnormal action in affected persons. See e.g. , 
Eichelbaum, 1996. Clin. Exp: Pharmacol Physiol, 23; 983-985; Linder, 1997. Clin. Chem: 9 43: 
254-266. In general, two types of pharmacogcnctic conditions can be differentiated. Genetic 

25 conditions transmitted as a single factor altering the way drugs act on the body (altered drug 

action) or genetic conditions transmitted as single factors altering the way the body acts on drugs 
(altered drug metabolism). These pharmacogenetic conditions can occur either as rare defects or 
as polymorphisms. For example, glucose-6-phosphate dehydrogenase (G6PD) deficiency is a 
common inherited enzymopathy in which the main clinical complication is hemolysis after 

30 ingestion of oxidant drugs (anti-malarials, sulfonamides, analgesics, nitrofurans) and 
consumption of fava beans. 

As an illustrative embodiment, the activity of drug metabolizing enzymes is a major 
determinant of both the intensity and duration of drug action. Hie discovery of genetic 
polymorphisms of drug metabolizing enzymes (e.g. , N-acetyltransferasc 2 (NAT 2) and 
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cytochrome P450 enzymes CYP2D6 and CYP2C19) has provided an explanation as to why 
some patients do not obtain the expected drug effects or show exaggerated drug response and 
serious toxicity after taking the standard and safe dose of a drug. These polymorphisms are 
expressed in two phenotypes in the population, the extensive metabolizer (EM) and poor 
5 metabolizer (PM). The prevalence of PM is different among different populations. For example, 
the gene coding for CYP2D6 is highly polymorphic and several mutations have been identified 
in PM, which all lead to the absence of functional CYP2D6. Poor metabolizers of CYP2D6 and 
CYP2C19 quite frequently experience exaggerated drug response and side effects when they 
receive standard doses. If a metabolite is the active therapeutic moiety, PM show no therapeutic 

10 response, as demonstrated for the analgesic effect of codeine mediated by its CYP2D6-formed 
metabolite morphine. At the other extreme are the so called ultra-rapid metabolizers who do not 
respond to standard doses. Recently, the molecular basis of ultra-rapid metabolism has been 
identified to be due to CYP2D6 gene amplification. 

Thus, the activity of MOLX protein, expression of MOLX nucleic acid, or mutation 

15 content of MOLX genes in an individual can be determined to thereby select appropriate agent(s) 
for therapeutic or prophylactic treatment of the individual. In addition, pharmacogenetic studies 
can be used to apply genotyping of polymorphic alleles encoding drug-metabolizing enzymes to 
the identification of an individual's drug responsiveness phenotype. This knowledge, when 
applied to dosing or drug selection, can avoid adverse reactions or therapeutic failure and thus 

20 enhance therapeutic or prophylactic efficiency whcn ; treating a subject with an MOLX 

modulator, such as a modulator identified by one of the exemplary screening assays described 
herein. 

Monitoring of Effects During Clinical Trials 

25 Monitoring the influence of agents (e.g. , drugs, compounds) on the expression or activity 

of MOLX (e.g. , the ability to modulate aberrant cell proliferation and/or differentiation) can be 
applied not only in basic drug screening, but also in clinical trials. For example, the 
effectiveness of an agent determined by a screening assay as described herein to increase MOLX 
gene expression, protein levels, or upregulate MOLX activity, can be monitored in clinical trails 

30 of subjects exhibiting decreased MOLX gene expression, protein levels, or downregulated 

MOLX activity. Alternatively, the effectiveness of an agent determined by a screening assay to 
decrease MOLX gene expression, protein levels, or downregulate MOLX activity, can be 
monitored in clinical trails of subjects exhibiting increased MOLX gene expression, protein 
levels, or upregulated MOLX activity. In such clinical trials, the expression or activity of 
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MOLX and, preferably, other genes that have been implicated in, for example, a cellular 
proliferation or immune disorder can be used as a "read out" or markers of the immune 
responsiveness of a particular cell. 

By way of example, and not of limitation, genes, including MOLX, that are modulated in 
cells by treatment with an agent (e.g., compound, drug or small molecule) that modulates MOLX 
activity (e.g. 9 identified in a screening assay as described herein) can be identified. Thus, to 
study the effect of agents on cellular proliferation disorders, for example, in a clinical trial, cells 
can be isolated and RNA prepared and analyzed for the levels of expression of MOLX and other 
genes implicated in the disorder. The levels of gene expression (i.e., a gene expression pattern) 
can be quantified by Northern blot analysis or RT-PCR, as described herein, or alternatively by 
measuring the amount of protein produced, by one of the methods as described herein, or by 
measuring the levels of activity of MOLX or other genes. In this manner, the gene expression 
pattern can serve as a marker, indicative of the physiological response of the cells to the agent. 
Accordingly, this response state may be determined before, and at various points during, 
treatment of the individual with the agent. 

In one embodiment, the invention provides a method for monitoring the effectiveness of 
treatment of a subject with an agent (e.g., an agonist, antagonist, protein, peptide, 
peptidomimetic, nucleic acid, small molecule, or other drug candidate identified by the screening 
assays described herein) comprising the steps of (/) obtaining a pre-administration sample from a 
subject prior to administration of the agent; (if) detecting the level of expression of an MOLX 
protein, mRNA, or genomic DNA in the preadministration sample; (Hi) obtaining one or more 
post-administration samples from the subject; (zv) detecting the level of expression or activity of 
the MOLX protein, mRNA, or genomic DNA in the post-administration samples; (v) comparing 
the level of expression or activity of the MOLX protein, mRNA, or genomic DNA in the 
pre-administration sample with the MOLX protein, mRNA, or genomic DNA in the post 
administration sample or samples; and (v/) altering the administration of the agent to the subject 
accordingly. For example, increased administration of the agent may be desirable to increase the 
expression or activity of MOLX to higher levels than detected, i.e., to increase the effectiveness 
of the agent. Alternatively, decreased administration of the agent may be desirable to decrease 
expression or activity of MOLX to lower levels than detected, i.e., to decrease the effectiveness 
of the agent. 
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Methods of Treatment 

The invention provides for both prophylactic and therapeutic methods of treating a 
subject at risk of (or susceptible to) a disorder or having a disorder associated with aberrant 
MOLX expression or activity. The disorders include cardiomyopathy, atherosclerosis, 

5 hypertension, congenital heart defects, aortic stenosis, atrial septal defect (ASD), 

atrioventricular (A-V) canal defect, ductus arteriosus, pulmonary stenosis, subaortic stenosis, 
ventricular septal defect (VSD), valve diseases, tuberous sclerosis, scleroderma, obesity, 
transplantation, adrenoleukodystrophy, congenital adrenal hyperplasia, prostate cancer, 
neoplasm; adenocarcinoma, lymphoma, uterus cancer, fertility, hemophilia, hypercoagulation, 

1 0 idiopathic thrombocytopenic purpura, immunodeficiencies, graft versus host disease, AIDS, 
bronchial asthma, Crohn's disease; multiple sclerosis, treatment of Albright Hereditary 
Osteodystrophy, and other diseases, disorders and conditions of the like. 
These methods of treatment will be discussed more fully, below. 

1 5 Disease and Disorders 

Diseases and disorders that are characterized by increased (relative to a subject not 
suffering from the disease or disorder) levels ot biological activity may be treated with 
Therapeutics that antagonize (i.e. , reduce or inhibit) activity. Therapeutics that antagonize 
. activity may be administered in a therapeutic or prophylactic manner. Therapeutics that may be 

20 utilized include, but are not limited to: (/) an aforementioned peptide, or analogs, derivatives, 
fragments or homologs thereof; (ii) antibodies to an aforementioned peptide; (iif) nucleic acids 
encoding an aforementioned peptide; (rv) administration of ann'sense nucleic acid and nucleic 
acids that are "dysfunctional" (f.e\, due to a heterologous insertion within the coding sequences 
of coding sequences to an aforementioned peptide) that are utilized to "knockout" endoggcnous 

25 function of an aforementioned peptide by homologous recombination (see, e.g. t Capecchi, 1989. 
Science 244: 1288-1292); or (v) modulators ( i.e., inhibitors, agonists and antagonists, including 
additional peptide mimetic of the invention or antibodies specific to a peptide of the invention) 
that alter the interaction between an aforementioned peptide and its binding partner. 

Diseases and disorders that are characterized by decreased (relative to a subject not 

30 suffering from the disease or disorder) levels or.biological activity may be treated with 

Therapeutics that increase (i.e. 9 are agonists to) activity. Therapeutics that upregulate activity 
may be administered in a therapeutic or prophylactic manner. Therapeutics that may be utilized 
include, but are not limited to, an aforementioned peptide, or analogs, derivatives, fragments or 
homologs thereof; or an agonist that increases bioavailability. 
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Increased or decreased levels can be readily detected by quantifying peptide and/or RNA, 
by obtaining a patient tissue sample (e.g., from biopsy tissue) and assaying it in vitro for RNA or 
peptide levels, structure and/or activity of the expressed peptides (or mRNAs of an 
aforementioned peptide). Methods that are well-known within the art include, but are not limited 
5 to, immunoassays (e.g., by Western blot analysis, immunoprecipitation followed by sodium 
dodecyl sulfate (SDS) polyacrylamide gel electrophoresis, irnmunocytochemistry, etc.) and/or 
hybridization assays to detect expression of mRNAs (e.g., Northern assays, dot blots, in situ 
hybridization, and the like). 

10 Prophylactic Methods 

In one aspect, the invention provides a method for preventing, in a subject, a disease or 
condition associated with an aberrant MOLX expression or activity, by administering to the 
subject an agent that modulates MOLX expression or at least one MOLX activity. Subjects at 
risk for a disease that is caused or contributed to by aberrant MOLX expression or activity can be 

15 identified by, for example, any or a combination of diagnostic or prognostic assays as described 
herein. Administration of a prophylactic agent can occur prior to the manifestation of symptoms 
characteristic of the MOLX aberrancy, such that a disease or disorder is prevented or, 
alternatively, delayed in its progression. Depending upon the type of MOLX aberrancy, for 
. example, an MOLX agonist or MOLX antagonist agent can be used for treating the subject The 

20 appropriate agent can be determined based on screening assays described herein. The 
prophylactic methods of the invention are further discussed in the following subsections. 

Therapeutic Methods 

Another aspect of the invention pertains to methods of modulating MOLX expression or 

25 activity for therapeutic purposes. The modulatory method of the invention involves contacting a 
cell with an agent that modulates one or more of the activities of MOLX protein activity 
associated with the cell. An agent that modulates MOLX protein activity can be an agent as 
described herein, such as a nucleic acid or a protein, a naturally-occurring cognate ligand of an 
MOLX protein, a peptide, an MOLX peptidomimetic, or other small molecule. In one 

30 embodiment, the agent stimulates one or more MOLX protein activity. Examples of such 

stimulatory agents include active MOLX protein and a nucleic acid molecule encoding MOLX 
that has been introduced into the cell. In another embodiment, the agent inhibits one or more 
MOLX protein activity. Examples of such inhibitory agents include antisense MOLX nucleic 
acid molecules and anti-MOLX antibodies. These modulatory methods can be performed in 
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vitro {e.g., by culturing the cell with the agent) or, alternatively, in vivo (e.g., by administering 
the agent to a subject). As such, the invention provides methods of treating an individual 
afflicted with a disease or disorder characterized by aberrant expression or activity of an MOLX 
protein or nucleic acid molecule. In one embodiment, the method involves administering an 
5 agent (e.g., an agent identified by a screening assay described herein), or combination of agents 
that modulates (e.g. y up-regulates or down-regulates) MOLX expression or activity. In another 
embodiment, the method involves administering an MOLX protein or nucleic acid molecule as 
therapy to compensate for reduced or aberrant MOLX expression or activity. 

Stimulation of MOLX activity is desirable in situations in which MOLX is abnormally 
10 downregulated and/or in which increased MOLX activity is likely to have a beneficial effect. 
One example of such a situation is where a subject has a disorder characterized by aberrant cell 
proliferation and/or differentiation (e.g., cancer or immune associated disorders). Another 
example of such a situation is where the subject has a gestational disease (e.g., preclampsia). 

Determination of the Biological Effect of the Therapeutic 

15 In various embodiments of the invention, suitable in vitro or in vivo assays are performed 

to determine the effect of a specific Therapeutic and whether its administration is indicated for 
treatment of the affected tissue. 

In various specific embodiments, in vitro assays may be performed with representative 
cells of the type(s) involved in the patient's disorder, to determine if a given Therapeutic exerts 
20 the desired effect upon the cell type(s). Compounds for use in therapy may be tested in suitable 
animal model systems including, but not limited to rats, mice, chicken, cows, monkeys, rabbits, 
. and the like, prior to testing in human subjects. Similarly, for in vivo testing, any of the animal 
model system known in the art may be used prior to administration to human subjects. 

Prophylactic and Therapeutic Uses of the Compositions of the Invention 

25 The MOLX nucleic acids and proteins of the invention are useful in potential 

prophylactic and therapeutic applications implicated in a variety of disorders including, but not 
limited to: metabolic disorders, diabetes, obesity, infectious disease, anorexia, cancer-associated 
cancer, neurodegenerative disorders, Alzheimer's Disease, Parkinson's Disorder, immune 
disorders, hematopoietic disorders, and the various dyslipidemias, metabolic disturbances 

30 associated with obesity, the metabolic syndrome X and wasting disorders associated with 

chronic diseases and various cancers. 

As an example, a cDNA encoding the MOLX protein of the invention may be useful in 

gene therapy, and the protein may be useful when administered to a subject in need thereof. By 
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way of non-limiting example, the compositions of the invention will have efficacy for treatment 
of patients suffering from: metabolic disorders, diabetes, obesity, infectious disease, anorexia, 
cancer-associated cachexia, cancer, neurodegenerative disorders, Alzheimer's Disease, 
Parkinson's Disorder, immune disorders, hematopoietic disorders, and the various dyslipidemias. 
5 Both the novel nucleic acid encoding the MOLX protein, and the MOLX protein of the 

invention, or fragments thereof, may also be useful in diagnostic applications, wherein the 
presence or amount of the nucleic acid or the protein are to be assessed. A further use could be 
as an anti-bacterial molecule (i.e., some peptides have been found to possess anti-bacterial 
properties). These materials are further useful in the generation of antibodies which 
10 immunospecifically-bind to the novel substances of the invention for use in therapeutic or 
diagnostic methods. 

Examples 

Example 1: Quantitative expression analysis of clones in various cells and tissues 

The quantitative expression of various clones was assessed using microtiter plates 

1 5 containing RNA samples from a variety of normal and pathology-derived cells, cell lines and 
tissues using real time quantitative PCR (RTQ PCR; TAQMAN*). RTQ PCR was performed 
on a Perlrin-Elmer Biosystems ABI PRISM® 7700 Sequence Detection System. Various 
collections of samples are assembled on the plates, and referred to as Panel 1 (containing cells 
and cell lines from normal and cancer sources), Panel 2 (containing samples derived from 

20 tissues, in particular from surgical samples, from normal and cancer sources), Panel 3 

(containing samples derived from a wide variety of cancer sources) and Panel 4 (containing cells 
and cell lines from normal cells and cells related to inflammatory conditions). 

First, the RNA samples were normalized to constitutively expressed genes such as P - ■ 
actin and GAPDFT. RNA (-50 ng total or ~ I ng poly A+) was converted to cDNA using the 

25 TAQMAN® Reverse Transcription Reagents Kit (PE Biosystems, Foster City, CA; Catalog No. 
N808-0234) and random hexamers according to the manufacturer's protocol. Reactions were 
performed in 20 ul and incubated for 30 min. at 48°C. cDNA (5 ul) was then transferred to a 
separate plate for the TAQMAN® reaction using p-actin and GAPDR TAQMAN® Assay 
Reagents (PE Biosystems; Catalog Nos. 4310881E and 4310884E, respectively) and 

30 TAQMAN® universal PCR Master Mix (PE Biosystems; Catalog No. 4304447) according to the 
manufacturer's protocol. Reactions were performed in 25 ul using the following parameters: 2 
min. at 50°C; 10 min. at 95°C; 15 sec. at 95°C/1 min. at 60°C (40 cycles). Results were recorded 
as CT values (cycle at which a given sample crosses a threshold level of fluorescence) using a 
log scale, with the difference in RNA concentration between a given sample and the sample with 
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the lowest CT value being represented as 2 to the power of delta CT. The percent relative 
expression is then obtained by taking the reciprocal of this RNA difference and multiplying by 
100. The average CT values obtained for B-actin and GAPDH were used to normalize RNA 
samples. The RNA sample generating the highest CT value required no further diluting, while 
5 all other samples were diluted relative to this sample according to their (5 -actin /GAPDH 
average CT values. 

Normalized RNA (5 ul) was converted to cDNA and analyzed via TAQMAN® using 
One Step RT-PCR Master Mix Reagents (PE Biosystems; Catalog No. 4309169) and gene- 
specific primers according to the manufacturer's instructions. Probes and primers were designed 

1 0 for each assay according to Perkin Elmer Biosystem's Primer Express Software package 

(version I for Apple Computer's Macintosh Power PC) or a similar algorithm using the target 
sequence as input. Default settings were used for reaction conditions and the following 
parameters were set before selecting primers: primer concentration = 250 nM, primer melting 
temperature (T m ) range = 58°-60° C, primer optimal Tm = 59° C, maximum primer difference = 

15 2° C, probe does not have 5' G, probe T m must be 10° C greater than primer T m , amplicon size 75 
bp to 100 bp. The probes and primers selected (see below) were synthesized by Synthegen 
(Houston, TX, USA). Probes were double purified by HPLC to remove uncoupled dye and 
evaluated by mass spectroscopy to verify coupling of reporter and quencher dyes to the 5' and 3' 
ends of the probe, respectively. Their final concentrations were: forward and reverse primers, 

20 900 nM each, and probe, 200nM. 

PCR conditions: Normalized RNA from each tissue and each cell line was spotted in 
each well of a 96 well PCR plate (Perkin Elmer Biosystems). PCR cocktails including two 
probes (a probe specific for the target clone and another gene-specific probe multiplexed with the 
target probe) were set up using IX TaqMan™ PCR Master Mix for the PE Biosystems 7700, 

25 with 5 mM MgC12, dNTPs (dA, G, C, U at 1 : 1 : 1 :2 ratios), 025 U/ml AmpliTaq Gold™ (PE 
Biosystems), and 0.4 U/^il RNase inhibitor, and 0.25 U/^l reverse transcriptase. Reverse 
transcription was performed at 48° C for 30 minutes followed by amplification/PCR cycles as 
follows: 95° C 10 min, then 40 cycles of 95° C for 15 seconds, 60° C for 1 minute. 

30 

In the results for Panel 1, the following abbreviations are used: 
ca. = carcinoma, 
- * - established from metastasis, 
met = metastasis, 
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s cell var= small cell variant, 

non-s = non-sm =non-srnall, 

squam = squamous, 

pi. eff = pi effusion = pleural effusion, 

glio = glioma, 

astro = astrocytoma, and 

neuro = neuroblastoma. 

Panel 2 

The plates for Panel 2 generally include 2 control wells and 94 test samples composed of 
RNA or cDNA isolated from human tissue procured by surgeons working in close cooperation 
with the National Cancer Institute's Cooperative Human Tissue Network (CHTN) or the 
National Disease Research Initiative (NDRI). The tissues are derived from human malignancies 
and in cases where indicated many malignant tissues have "matched margins" obtained from 
noncancerous tissue just adjacent to the tumor. These are termed normal adjacent tissues and are 
denoted "NAT" in the results below! The tumor tissue and the "matched margins" are evaluated 
by two independent pathologists (the surgical pathologists and again by a pathologists at NDRI 
or CHTN). This analysis provides a gross histopathological assessment of tumor differentiation 
grade. Moreover, most samples include the original surgical pathology report that provides 
information regarding the clinical stage of the patient. These matched margins are taken from the 
tissue surrounding (i.e. immediately proximal) to the zone of surgery (designated "NAT", for 
normal adjacent tissue, in Table RR). In addition, RNA and cDNA samples were obtained from 
various human tissues derived from autopsies performed on elderly people or sudden death 
victims (accidents, etc.). These tissue were ascertained to be free of disease and were purchased 
from various commercial sources such as Clontech (Palo Alto, CA), Research Genetics, and 
Invitrogcn. 

RNA integrity from all samples is controlled for quality by visual assessment of agarose 
gel electropherograms using 28S and 18S ribosomal RNA staining intensity ratio as a guide (2:1 
to 2.5: 1 28s: 1 8s) and the absence of low molecular weight RNAs that would be indicative of 
degradation products. Samples are controlled against genomic DNA contamination by RTQ 
PCR reactions run in the absence of reverse transcriptase using probe and primer sets designed to 
amplify across the span of a single exon. 
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Panel 4 

Panel 4 includes samples on a 96 well plate (2 control wells, 94 lest samples) composed 
of RNA (Panel 4r) or cDNA (Panel 4d) isolated from various human cell lines or tissues related 
5 to inflammatory conditions. Total RNA from control normal tissues such as colon and lung 

(Stratagene ,La Jolla, CA) and thymus and kidney (Clontech) were employed. Total RNA from 
liver tissue from cirrhosis patients and kidney from lupus patients was obtained from BioChain 
(Biochain Institute, Inc., Hayward, CA). Intestinal tissue for RNA preparation from patients 
diagnosed as having Crohn's disease and ulcerative colitis was obtained from the National 
10 Disease Research Interchange (NDR1) (Philadelphia, PA). 

Astrocytes, lung fibroblasts, dermal fibroblasts, coronary artery smooth muscle cells, 
small airway epithelium, bronchial epithelium, microvascular dermal endothelial cells, 
microvascular lung endothelial cells, human pulmonary aortic endothelial cells, human umbilical 
vein endothelial cells were all purchased from Clonetics (Walkersville, MD) and grown in the 

15 media supplied for these cell types by Clonetics. These primary cell types were activated with 
various cytokines or combinations of cytokines for 6 and/or 12-14 hours, as indicated. The 
following cytokines were used; 1L-1 beta at approximately 1-5 ng/ml, INF alpha at 
approximately 5-10 ng/ml, IFN gamma at approximately 20-50 ng/ml, IL-4 at approximately 5- 
10 ng/ml, IL-9 at approximately 5-10 ng/ml, IL-13 at approximately 5-10 ng/ml. Endothelial 

20 cells were sometimes starved for various times by culture in the basal media from Clonetics with 
0.1%serum. 

Mononuclear cells were prepared from blood of employees at CuraGen Corporation, 
using Ficoll. LAK. cells were prepared from these cells by culture in DMEM 5% FCS 
•(Hyclone), 100 uM non essential amino acids (Gibco/Life Technologies, Rockville, MD), 1 mM 

25 sodium pyruvate (Gibco), mercaptoethanol 5.5 x 10" 5 M (Gibco), and 10 mM Hepes (Gibco) and 
Interleukin 2 for 4-6 days. Cells were then either activated with 10-20 ng/ml PMA and 1-2 
fig/ml ionomycin, EL- 1 2 at 5-10 ng/ml, IFN gamma at 20-50 ng/ml and IL-18 at 5-10 ng/ml for 6 
hours. In some cases, mononuclear cells were cultured for 4-5 days in DMEM 5% FCS 
(Hyclone), 100 uM non essential amino acids (Gibco), 1 mM sodium pyruvate (Gibco), 

30 mercaptoethanol 5.5 x 10* 5 M (Gibco), and 10 mM Hepcs (Gibco) with PHA 

(phytohemagglutinin) or PWM (pokeweed mitogen) at approximately 5 ng/ml. Samples were 
taken at 24, 48 and 72 hours for RNA preparation. MLR (mixed lymphocyte reaction) samples 
were obtained by taking blood from two donors, isolating the mononuclear cells using Ficoll and 
mixing the isolated mononuclear cells 1:1 at a final concentration of approximately 2xl0 6 
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cells/ml in DMEM 5% FCS (Hyclone), 100 jiM non essential amino acids (Gibco), 1 niM 
sodium pyruvate (Gibco), mercaptoethanol (5.5 x 10" 5 M) (Gibco), and 10 mM Hepcs (Gibco). 
The MLR was cultured and samples taken at various time points ranging from 1- 7 days for RNA 
preparation. 

Monocytes were isolated from mononuclear cells using CD1 4 Miltenyi Reads, +ve VS 
selection columns and a Vario Magnet according to the manufacturer's instructions. Monocytes 
were differentiated into dendritic cells by culture in DMEM 5% fetal calf serum (FCS) 
(Hyclone, Logan, UT), 100 |oM non essential amino acids (Gibco), 1 mM sodium pyruvate 
(Gibco), mercaptoethanol 5.5 x 10" 5 M (Gibco), and 10 mM Hepes (Gibco), 50 ng/ml GMCSF 
and 5 ng/ml IL-4 for 5-7 days. Macrophages were prepared by culture of monocytes for 5-7 days 
in DMEM 5% FCS (Hyclone), 100 jaM non essential amino acids (Gibco), 1 mM sodium 
pyruvate (Gibco), mercaptoethanol 5.5 x 10' 5 M (Gibco), 10 mM Hepes (Gibco) and 10% AB 
Human Serum or MCSF at approximately 50 ng/ml. Monocytes, macrophages and dendritic cells 
were stimulated for 6 and 12-14 hours with lipopolysaccharide (LPS) at 100 ng/ml. Dendritic 
cells were also stimulated with anti-CD40 monoclonal antibody (Pharmingen) at 10 fag/ml for 6 
and 12-14 hours. 

CD4 lymphocytes, CDS lymphocytes and NK cells were also isolated from mononuclear 
cells using CD4, CD8 and CD56 Miltenyi beads, positive VS selection columns and a Vario 
Magnet according to the manufacturer's instructions. CD45RA and CD45RO CD4 lymphocytes 
were isolated by depleting mononuclear cells of CDS, CD56, CD14 and CD1 9 cells using CD8, 
CD56, CD14 and CD19 Miltenyi beads and +ve selection. Then CD45RO beads were used to 
isolate the CD45RO CD4 lymphocytes with the remaining cells being CD45RA CD4 
lymphocytes. CD45RA CD4, CD45RO CD4 and CD8 lymphocytes. were placed in DMEM 5% 
FCS (Hyclone), 100 pM non essential amino acids (Gibco), 1 mM sodium pyruvate (Gibco), 
mercaptoethanol 5.5 x l& 5 M (Gibco), and 10 mM Hepes (Gibco) and plated at 10 6 cells/ml onto 
Falcon 6 well tissue culture plates that had been coated overnight with 0.5 fig/ml anti-CD28 
(Pharmingen) and 3 ug/ml anti-CD3 (OKT3, ATCC) in PBS. After 6 and 24 hours, the cells 
were harvested for RNA preparation. To prepare chronically activated CD8 lymphocytes, we 
activated the isolated CDS lymphocytes for 4 days on anti-CD28 and anti-CD3 coated plates and 
then harvested the cells and expanded them in DMEM 5% FCS (Hyclone), 100 jiM non essential 
amino acids (Gibco), 1 mM sodium pyruvate (Gibco), mercaptoethanol 5.5 x 10* 5 M (Gibco), 
and 10 mM Hepes (Gibco) and IL-2. The expanded CDS cells were then activated again with 
plate bound anti-CD3 and anti-CD28 for 4 days and expanded as before. RNA was isolated 6 
and 24 hours after the second activation and after 4 days of the second expansion culture. The 
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isolated NK cells were cultured in DMEM 5% FCS (Hyclone), 100 ^iM non essential amino 
acids (Gibco), 1 mM sodium pyruvate (Gibco), mercaptoethanol 5.5 x 10 s M (Gibco), and 10 
mM Hcpes (Gibco) and IL-2 for 4-6 days before RNA was prepared. 

To obtain B cells, tonsils were procured from NDRI. The tonsil was cut up with sterile 
5 dissecting scissors and then passed through a sieve. Tonsil cells were then spun down and 
resupended at 10 6 cells/ml in DMEM 5% FCS (Hyclone), 100 jiM non essential amino acids 
(Gibco), 1 mM sodium pyruvate (Gibco), mercaptoethanol 5.5 x 10* s M (Gibco), and 10 mM 
Hepes (Gibco). To activate the cells, we used P WM at 5 \xgfm\ or anti-CD40 (Pharmingen) at 
approximately 10 fig/ml and IL-4 at 5-10 ng/ml. Cells were harvested for RNA preparation at 
10 24,48 and 72 hours. 

To prepare the primary and secondary Thl/Th2 and Trl cells, six-well Falcon plates were 
coated overnight with 10 fig/ml anti-CD28 (Pharmingen) and 2 ng/ml OKT3 (ATCC), and then 
washed twice with PBS. Umbilical cord blood CD4 lymphocytes (Poietic Systems, German 

5 6 

Town, MD) were cultured at 10 -10 cells/ml in DMEM 5% FCS (Hyclone), 100 pM non 
15 essential amino acids (Gibco), 1 mM sodium pyruvate (Gibco), mercaptoethanol 5.5 x 10" 5 M 
(Gibco), 10 mM Hepes (Gibco) and IL-2 (4 ng/ml). IL-12 (5 ng/ml) and anti-114 (1 Dg/ml) 
were used to direct to Thl, while IL-4 (5 ng/ml) and anti-EFN gamma (1 Dg/ml) were used to 
direct to Th2 and IL-10 at 5 ng/ml was used to direct to Trl. After 4-5 days, the activated Thl, 
Th2 and Trl lymphocytes were washed once in DMEM and expanded for 4-7 days in DMEM 
20 5% FCS (Hyclone), 100 pM non essential amino acids (Gibco), 1 mM sodium pyruvate (Gibco), 
mercaptoethanol 5.5 x 10" 5 M (Gibco), 10 mM Hepes (Gibco) and IL-2 (1 ng/ml). Following . 
this, the activated Thl , Th2 and Trl lymphocytes were re-stimulated for 5 days with anti- 
CD28/OKT3 and cytokines as described above, but with (he addition of anti-CD95L (1 Dg/ml) 
to prevent apoptosis. After 4-5 days, the Thl, Th2 and Trl lymphocytes were washed and then 
25 expanded again with TL-2 for 4-7 days. Activated Thl and Th2 lymphocytes were maintained in 
this way for a maximum of three cycles. RNA was prepared from primary and secondary Thl, 
Th2 and Trl after 6 and 24 hours following the second and third activations with plate bound 
anti-CD3 and anti-CD28 mAbs and 4 days into the second and third expansion cultures in 
Interleukin 2. 

30 The following leukocyte cells lines were obtained from the ATCC: Ramos, EOL-1 , KU- 

8 12. EOL cells were further differentiated by culture in 0. 1 mM dbcAMP at 5 xlO 5 cells/ml for 

8 days, changing the media every 3 days and adjusting the cell. concentration to 5 xlO 5 cells/ml. 

For the culture of these cells, we used DMEM or RPM1 (as recommended by the ATCC), with 

the addition of 5% FCS (Hyclone), 100 jiM non essential amino acids (Gibco), 1 mM sodium 

147 



BNSDOCID: <WO 01B157BA2J_> . 



WO 01/81578 



PCT/USOI/13578 



pyruvate (Gibco), mercaptoethanol 5.5 x 10' 5 M (Gibco), 10 mM Hepes (Gibco). RNA was 
either prepared from resting cells or cells activated with PMA at 10 ng/ml and ionomycin at 1 
ug/ml for 6 and 14 hours. Keratinocyte line CCD106 and an airway epithelial tumor line NCI- 
H292 were also obtained from the ATCC. Both were cultured in DMEM 5% FCS (Hyclone), 
100 |xM non essential amino acids (Gibco), 1 mM sodium pyruvate (Gibco), mercaptoethanol 5.5 
x 10' 5 M (Gibco), and 10 mM Hepes (Gibco). CCD1 106 cells were activated for 6 and 14 hours 
with approximately 5 ng/ml TOF alpha and 1 ng/ml IL-1 beta, while NCI-H292 cells were 
activated for 6 and 14 hours with the following cytokines: 5 ng/ml IL-4, 5 ng/ml IL-9, 5 ng/ml 
IL-1 3 and 25 ng/ml IFN gamma. 

For these cell lines and blood cells, RNA was prepared by lysing approximately 10 7 
cells/ml using Trizol (Gibco BRL). Briefly, 1/10 volume of bromochloropropane (Molecular 
Research Corporation) was added to the RNA sample, vortexed and after 10 minutes at room 
temperature, the lubes were spun at 14,000 rpm in a Sorvall SS34 rotor. The aqueous phase was 
removed and placed in a 1 5 ml Falcon Tube. An equal volume of isopropanol was added and left 
at -20 degrees C overnight. The precipitated RNA was spun down at 9,000 rpm for 15 mrn in a 
Sorvall SS34 rotor and washed in 70% ethanol. The pellet was redissolved in 300 ^1 of RNAse- 
free water and 35 pi buffer (Promega) 5 pi DTT, 7 fxl RNAsin and 8 pi DNAse were added. The 
tube was incubated at 37 degrees C for 30 minutes to remove contaminating genomic DNA, 
extracted once with phenol cWoroform and re-precipitated with 1/10 volume of 3 M sodium 
acetate and 2 volumes of 100% ethanol. The RNA was spun down and placed in RNAse free 
water. RNA was stored at -80 degrees C. 

MOL1 

Expression of gene GM_79960 1 78 was assessed using the primer-probe set Agl 605, 
described in Table 12. Results of the RTQ-PCR runs are shown in Tables 1 3 and 14. 



Table 12. Probe Name: Agl 605 



Primers 


Sequences 


Tm 


Length 


Start 
Position 


SEQID 
NO: 


Forward 


5'-CCTGAGCTACAACAACATCATG-3' 


58.3 


22 


333 


76 


Probe 


FAM-5'- 

CCTCATATCCCTGTCCCTCAGCCATA- 
3'-TAMRA 


69 


26 


378 


77 


Reverse 


5'-GCAGAGTCTAGCATCAGGATGT-3' 


58.6 


22 


407 


78 
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Table 13. Panels 1.3D and 2D 



PANEL 1.3D 




PANEL 2D 






Tissue Name 


Rel. Expr., 

% 

1.3dx4tm54 
02f agl605 
bl 


Tissue Name 


Rel. Expr., 

% 

2Dtm2728f 
agl605 


Rel. Expr., 
% 

2dx4tm4732 
f_ag!605_a 
2 


Liver 

adenocarcinoma 


1.9 


Normal Colon GENPAK 
061003 


70.2 


70.3 


Pancreas 


o 


83219 CC Well to Mod Diff 
(OD03866) 


8.5 


6.3 


Pancreatic ca. 
CAPAN2 


17.6 


83220 CC NAT (OD03866) 


8.1 


9.4 


Adrenal gland 


1.3 


83221 CC Gr.2 rectosigmoid 
(OD03868) 


4.5 


12.1 


Thyroid 


5.6 


83222 CCNAT(OD03868) 


1.6 


6.3 


Salivary gland 


8.9 


83235 CC Mod Diff 
(ODO3920) 


17.9 


19.7 


Pituitary gland 


1.7 


83236 CC NAT (ODO3920) 


21.5 


11.6 




21.2 


83237 CC Gr.2 ascend colon 
(OD03921) 


18 


34.4 


Brain (whole) 


30.5 


83238 CC NAT (OD03921) 


8.9 


9.8 


Brain 

(amygdala) 


6.6 


8324 1 CC from Partial : 
Hepatectorriy (ODO4309) 


7.1 


4.9 


Brain 

(cerebellum) 


14.6 


83242 Liver NAT (ODO4309) 


17.1 


6.9 


Brain 

(hippocampus) 


8.7 


87472 Colon mets to lung 
(OD04451-01) 


6.7 


11.4 


Brain 

(substantia 

nigra) 


9.3 


87473 Lung NAT (OD04451- 
02) 


0.8 


2.9 


Brain 
(thalamus) 


4.8 


Normal Prostate Clontech A+ 
6546-1 


17.8 


52.9 


Cerebral Cortex 


2.6 


84140 Prostate Cancer 
(OD04410) 


18.6 


23 


Spinal cord 


15.7 


84141 Prostate NAT 
(OD04410) 


9.1 


22 


CNS ca. 

(glio/astro) 

U87-MG 


0 


87073 Prostate Cancer 
(OD04720-01) 


20 


14.5 


CNS ca. 
(glio/astro) Ti- 
ll 8-MG . 


1.5 


87074 Prostate NAT 
(OD04720-02) 


21.3 


38.1 


CNS ca. (astro) 
SW1783 


0. 


Normal Lung GENPAK 
061010 


69.7 


85.9 
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CNS ca.* 
(neuro; met ) 
SK-N-AS 


1.6 


83239 Lung Met to Muscle 
(OD04286) 


11.1 


6.3 


CNS ca. (astro) 
SF-539 


0 


83240 Muscle NAT 
(OD04286) 


15.1 


23.3 


CNS ca. (astro) 
SNB-75 


0 


84 1 36 Lung Malignant Cancer 
(OD03126) 


30.6 


30.8 


CNS ca. (glio) 
SNB-19 


9.7 


84137 Lung NAT (OD03126) 


8 


16.3 


CNS ca. (glio) 
U251 


4.8 


84871 Lung Cancer 
fOD04404) 


41.8 


45.2 


CNS ca. (glio) 
SF-295 


7.7 


84872 Lung NAT (OD04404) 


15.4 


27.5 


Heart (fetal) 


0.9 


84875 Lung Cancer 
(OD04565) 


7.4 


8.5 


Heart 


6.1 


84876 Lung NAT (OD04565) 


5.3 


1.2 


Fetal Skeletal 


3.5 


85950 Lung Cancer 
COD04237-0n 


77.9 


72.9 


Skeletal muscle 


100 


85970 Lung NAT (OD04237- 
02) 


14.3 


12.9 


Bone marrow 


22.8 


83255 Ocular Mel Met to 
Liver (ODO4310) 


2.7 


6 


Thymus 


12.2 


83256 Liver NAT (ODO4310) 


2.3 


5.1 


Spleen 


14.6 


84139 Melanoma Metsto 
Lune(OD04321 > ! 


1.3 


6.2 


Lymph node 


57.9 


84138 Lung NAT (OD04321) 


36.9 


22.4 


Colorectal 


3.4 


formal Kidney GENPAK 
061008 


9.2 


19.2 


Stomach 


20.7 


83786 Kidney Ca, Nuclear 
grade2(OD04338) 


4.2 


6.9 


Small intestine 


33.5 


83787 Kidney NAT 
(OD04338) 


7 


8-3- 


Colon ca. 
SW480 


0.6 


83788 Kidney Ca Nuclear 
grade 1/2 (OD04339) 


18.7 


11.3 


Colon ca.* 

(SW480 

met)SW620 


0 


83789 Kidney NAT 
(OD04339) 


4.9 


8.5 


Colon ca. 
IIT29 


0 


83790 Kidney Ca, Clear cell 
type (OD04340) 


6.3 


14.8 


Colon ca. 
HCT-116 


0 


83791 Kidney NAT 
(OD04340) 


10.7 


14.6 


Colon ca. 
CaCo-2 


2.2 


83792 Kidney Ca, Nuclear 
grade 3 (OD04348) 


6 


7.5 


83219 CC Well 
to Mod Diff 
(OD03866) 


0 


83793 Kidney NAT 
(OD04348) . 


14.5 


16.5 
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Colon ca. 
HCC-2998 


0 


87474 Kidney Cancer 
(OD04622-01) 


15.4 


25 


Gastric ca.* 
(liver met)NCI- 
N87 


7.2 


87475 Kidney NAT 
(OD04622-03) 


2!3 


2.8 


Bladder 


2.5 


85973 Kidney Cancer 
(OD04450-01) 


3.6 


7.3 


Trachea 


4.9 


85974 Kidney NAT 
(OD04450-03) 


1.3 


13.3 


Kidney 


0 


Kidney Cancer Clontech 
8120607 


7.3 


5.2 


Kidney (fetal) 


7 


Kidney NAT Clontech 
8120608 


1-1 


3.6 


Renal ca. 
786-0 


0 


Kidney Cancer Clontech 
8120613 


2.8 


7.5 


Renal ca. 
A498 


1.5 


Kidney NAT Clontech 
8120614 


3.7 


8 


Renal ca. 
RXF393 


3 


Kidney Cancer Clontech 
9010320 


17.4 


14.1 


Renal ca, 
ACHN 


1.1 


Kidney NAT Clontech 
9010321 


16.4 


11.3 


Renal ca. 
UO-31 


0 


Normal Uterus GENPAK 
061018 


4.6 


5.2 


Renal ca; 
TK-10 


1.2 


Uterus Cancer GENPAK 
064011 


17.2 


32.4 


Liver 


0.6 


Normal Thyroid Clontech A+ 
6570-1 


17.3 


18.1 . 


Liver (fetal) 


8.8 


Thyroid Cancer GENPAK 
064010 


6.5 


1.1 


Liver ca. 

(hepatoblast) 

HcpG2 


24.6 


Thyroid Cancer 
INVITROGENA302152 


1.7 


1.6 


Lung 


1.3 


Thyroid NAT INVITROGEN 
A302153 : 


9.2 


17.7 


Lung (fetal) 


15.8 


Normal Breast GENPAK 
061019 


21.3 


30.1 


Lung ca. (small 
cell) LX-1 


1.5 


84877 Breast Cancer 
(OD04566) 


1.4 


1.5 


Lung ca. (small 
cell) NCI-H69 


4.2 


85975 Breast Cancer 
(OD04590-01) 


12.9 


10.1 


Lung ca. (sxell 
var.) SHP-77 


1.4 


85976 Breast Cancer Mets 
(OD04590-03) 


100 


88 . 


Lung ca. (large 
cell)NCI-H460 


0 


87070 Breast Cancer 
Metastasis (OD04655-05) 


32.8 


0 


Lung ca. (non- 
sm. cell)A549 


2.9 


GENPAK Breast Cancer 
064006 


25 


28.5 


Lung ca. (non- 
s.cell)NCI-H23 


9.5 


Breast Cancer Res. Gen. 1024 


51.8 


66.3 
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Lung ca (non- 
s.cell) HOP-62 


3.4 


Breast Cancer Clontech 
9100266 


25 


25.1 


Lung ca. (non- 
s.cl) NCI-H522 


0 


Breast NAT Clontech 9100265 


18.2 


31.7 


Lung ca. 
(squam.) SW 
900 


4.7 


Breast Cancer INVITROGEN 
A209073 


15 


21 


Lung ca. 
(squam.) NCI- 
H596 


0 


Breast NAT INVITROGEN 
A2090734 


16.2 


12.3 


Mammary 
gland 


0 


Normal liver GENPAK 
061009 


5.6 


4.9 


Breast ca.* (pi. 
effusion) MCF- 
7 


0.7 


Liver Cancer GENPAK 
064003 


5.4 


11.8 


Breast ca.* 
(pl.ef) MDA- 
MB-231 


6.1 


Liver Cancer Research 
Genetics RNA 1025 


3.7 


3.6 


Breast ca.* (pi. 

effusion) 

T47D 


5.1 


Liver Cancer Research 
Genetics RNA 1026 


4.8 


5.5 


Breast ca. 
BT-549 


2.8 


Paired Liver Cancer Tissue 
Research Genetics RNA 6004- 
T 


10.6 


7 


Breast ca. 
MDA-N 


0 


Paired Liver Tissue Research 
Genetics RNA 6004-N 


19.3 


28.4 


Ovary 


2.7 


Paired Liver Cancer Tissue 
Research Genetics RNA 6005- 
T 


4.6 


6 


Ovarian ca. 
OVCAR-3 


3.2 


Paired Liver Tissue Research 
Genetics RNA 6005-N 


5 


0 


Ovarian ca. . 
OVCAR-4 


3.7 


Normal Bladder GENPAK 
061001 


18.3 


11.5 


Ovarian ca. 
OVCAR-5 


2.5 


Bladder Cancer Reseiarch 
Genetics RNA 1023 


7.9 


21.4 . 


Ovarian ca. 
OVCAR-8 


1.7 


Bladder Cancer 
INVITROGEN A302 173 


33 


18.9 


Ovarian ca. 
IGROV-1 


0 


87071 Bladder Cancer . 
(OD04718-01) 


11 


18.5 


Ovarian ca.* 
(ascites) SK- 
OV-3 


1.8 


87072 Bladder Normal 
Adjacent (OD04718-03) 


9.4 


9.7 


Uterus 


24.6 


Normal Ovary Res. Gen. 


11.6 


11.3 


Placenta 


2.8 


Ovarian Cancer GENPAK 
064008 


23.3 


22.9 


Prostate 


2.5 


87492 Ovary Cancer 
(OD04768-07) 


80.1 


100 


Prostate ca.* 
(bone met)PC-3 


26.5 


87493 Ovary NAT 
(OD04768-08) 


3.8 


6.4 
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Testis 


0 


Normal Stomach GENPAK 
061017 


15.9 


17.1 


Melanoma 
HsC88(A).T 


0 


Gastric Cancer Clontech 
9060358 


36.3 


45.8 


Melanoma* 
(met) 

Hs688(B).T 


'0 


NAT Stomach Clontech 
9060359 


44.4 


60.8 


Melanoma 
UACC-62 


1.2 


Gastric Cancer Clontech 
9060395 


13.8 


20.4 


Melanoma 
M14 


0 


NAT Stomach Clontech 
9060394 


78.5 


93 


Melanoma 
LOXIMVI 


0.6 


Gastric Cancer Clontech 
9060397 


5.8 


11.5 


Melanoma* 
(met) SK- 
MEL-5 


4.1 


NAT Stomach Clontech 
9060396 


25.5 


30.1 


Adipose 


2 


Gastric Cancer GENPAK 
064005 


35.1 


27.6 



Table 14. Panel 3D 



Tissue Name 


Rel. Expr., % 
3dtm5227f_agl605 


94905_Daoy_Medulloblastoma/Cerebellum_sscDNA 


0.0 


94906_TE671_Medulloblastom/Cerebellum_sscPNA 


0.4 


94907_D283Med_MedulIoblastoma/Cerebellum_sscIWA 


0.0 


94908_PFSK-l_Primitive 
Neuroectodermal/Gerebellum sscDNA 


0.0 


94909_XF-498_CNS_sscDNA 


0.0 


94910_SNB-78_CNS/glioma_sscDNA 


0.0 


949irSF-268_CNS/glioblastoma_sscDNA 


. 0.0 


94912 T98G Glioblastoma sscDNA 


0.0 


96776_SK-N-SH_Neuroblastoma (metastasis)_sscDNA 


0.0 


94913_SF-295_CNS/gIioblastoma_sscDNA 


02 


94914 Cerebellum sscDNA 


1.9 


96777_Cerebellum_sscDNA 


3.4 


94916_NCI-H292_Mucoepidermoid lung carcinoma_sscDNA 


0.4 


94917_DMS-1 14_Small cell lung cancer_sscDNA 


0.0 


94918_DMS-79_Small cell lung 
cancer/neuroendocrine sscDNA 


3.8 


94919_NCI-H146_Small cell lung 
cancer/neuroendocrine sscDNA 


0.0 


94920_NCI-H526_Small cell lung 
cancer/neuroendocrine sscDNA 


14.6 


94921_NCI-N417_Small cell lung 
cancer/neuroendocrine sscDNA 


0.0 
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94923_NCI-HS2_SmalI cell lung 
cancer/neuroendocrine sscDNA 


0.2 


94924_NCI-H1 57__Squamous cell lung cancer 
(metastasis) sscDNA 


0.0 


94925JMC1-H1 1 55_Large cell lung 
cancer/neuroendocrine sscDNA 


0.0 


94926_NCI-H1299_Large cell lung 
cancer/neuroendocrine sscDNA 


0.0 


y4927_NCI-H727_Lung carcmoid_sscDNA 


0.5 


9492S_NCI-UMC-1 l_Lung carcinoidsscDNA 


0.6 


94929_LX-1 Small cell lung cancersscDNA 


0.2 


94930_CoIo-205_Colon cancer_sscDNA 


0.0 


94931 KM12 Colon cancer sscDNA 


0.0 


94932 KM20L2 Colon cancer sscDNA 


0.0 


94933 NCI-H716 Colon cancer sscDNA 


0.0 


94935_SW-48_Colon adenocarcmoma_sscDNA 


0.1 


y4936_SW 1 1 16_Colon adenocarcmoma_sscDNA 


0.0 


94937 LS 174T Colon adenocarcinoma sscDNA 


0.0 


94938 SW-948 Colon adenocarcinoma sscDNA 


0.0 


94939 SW-480 Colon adenocarcinoma sscDNA 


0.0 


94940_NCI-SNU-5_Gastnc carcinoma_sscDNA 


0.2 


94941 KATO ITT Gastnc carcinoma sscDNA 


0.4 


94943 NCI-SNU-16 Gastric carcinoma sscDNA 


0.0 


94944 NCI-SNU-1 Gastnc carcinoma sscDNA 


6.0 


94946_RF-l_Gastne adenocarcmoma__sscDNA 


15.9 


94947_RF-48_Gastncadenocarcinoma_sscDNA 


18.7 


96778 MKN-45 Gastnc carcinoma sscDNA 


0.0 


94949 NCI-N87 Gastric carcinoma sscDNA 


0.0 


9495 l_0VCAR-5_0varian carcinoma_sscDNA 


0.0 


94952 JlL95-2JUterine carcinoma_sscDNA 


0.0 


94953_HelaS3_Cervical adenocarcinoma sscDNA 


0.0 


94954^Ca Ski_Cervical epidermoid carcinoma 
(metastasis)_sscDNA 


0.2 


94?55_ES-2_Ovarian clear cell carcinoma_sscDNA 


0.0 


94_957_Ramos/6h stim "; Stimulated with PMA/ionomycin 
6h sscDNA 


14.7 


94958Jfcamos/14h stim "; Stimulated with PMAAonomycin 
14h sscDNA 


22.7 


94962_MEG-01_Chronic myelogenous leukemia 
(megokaiyoblast) sscDNA 


0.5 


y4yoj_Kaji_^£jurtitts Jymphoma_sscDNA 


32.3 


94964_Daudi_Burkitt , s lymphoma_sscDNA 


100.0 


94965__U266_B-cell plasmac)rtoma/myeloma_sscDNA 


0.3 


94968_CA46_Burkitt , slymphoma_sscDNA 


22.5 


94970_RL_non-Hodgkin's B-cell lymphoma^sscDNA 


29.5 
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y+zf / jjvij ui c d ten jyijjuijtiina/ louivt-iiiiti .\itj/iiA 


23.5 


0407^ TurVat T r*»ll 1#»iitpmfn qcpTYWA 


2.2 


04074 TP-1 T7rvtVirrtl*»iilrpmin ccrTYWA 
7*t7/M lily UUUlCUKClLlIa oal/J-zINrt. 


0.3 




1.6 


04077 7T0T7 Wicfinrvtir IvmiVhrtmn «rFVNA 


7.3 




0.1 


94981 769-P Clear cell renal carcinoma sscDNA 


0.2 


94983 Caki-2 Clear cell renal carcinoma sscDNA 


0.1 


94984 SW839 Clear cell renal carcinoma sscDNA 


1.0 


94986_G401_Wilms t tumorjsscDNA 


0.0 


94987_Hs766TJPancreatic carcinoma (LN metastasis)_sscDNA 


0.1 


949S8_CAPAN-l_Pancreatic adenocarcinoma (liver 
metastasis) sscDNA 


0.0 


94989_SLT86.86_Pancreatic carcinoma (liver 
metastasis) sscDNA 


A A 

0.0 


94990 BxPC-3 Pancreatic adenocarcinoma sscDNA 


A 1 

O.l 


94991 HPAC Pancreatic adenocarcinoma sscDNA 


A i 

0.1 


94992 MIA PaCa-2 Pancreatic carcinoma sscDNA 


A A 

0.0 


94993_CFPAC-l_Pancreatic ductal adenocarcinoma_sscDNA 


A O 

0.2 


94994_PANC-l_Pancreatic epithelioid ductal 
carcinoma sscDNA 


A A 

0.0 


0400^ T74 "RlnHH^r r*nrr*inTn?( ftrancitinTinl ppI^ Q<;rT*)NA 
y^rJJyj DluUUCI (^ctlvlllllla ^LlallblUUJlal l>CUJ oai/t/iNn. 


0.0 


04QQ7 ^fJXl R1ar1H*»r r-flrrrnnTnn ccpTYWA 

ynyy / joj/- i3iau.ucr carvinuiua c>ovj-/i>/\. 


0.0 




0.0 


94999_UM-UC-3_Bladder carcinma (transitional cell)_sscDNA 


0.0 . . 


95000_A204_Rhabdomyosarcoma_sscDNA 


A A 

0.0 


95001 HT-1080 Fibrosarcoma sscDNA 


0.0 


95002_MG-63_Osteosarcoma (bone)_sscDNA 


A 1 

0.1 


95003_SK-LMS-l__Leiomyosarcoma (vulva)_sscDNA 


0.2 


95004_SJRH30_Rhal?domyosarcoma (met to bone 
marrow) sscDNA 


A A 

0.0 


yj\J\JO J\Hj i__XipjUClUi01.0 CartinUIDa^&dUJ-'.Pi.rt. 


0.0 


ncnn7 W7Af7A/x_4 \Ao1ftTiAma ecoFYW A. 


00 


95010 DU 145 Prostate carcinoma (brain metastasis) sscDNA 


0.0 


950 1 2_MDA-MB-468_Breast a denocarcinoma_sscDNA 


0.0 


95013_SCC-4_Squamous cell carcinoma of tongue_sscDNA 


0.0 • 


95014_SCC-9_Squamous cell carcinoma of tongue_sscDNA 


0.3 


95015_SCC-15_Squamous cell carcinoma of tongue_sscDNA 


0.0 


95017_CAL 27_Squamous cell carcinoma of tongue_sscDNA 


0.0 



Hie RTQ-PCR analysis (Table 13 and 14) reveals that MOL1 is predominantly expressed 
in cell lines derived from lymphoma and leukemia, specifically Burkitt ! s lymphoma in panel 3D. 
This result is supported by the presence of GenBank ESTs coming from T cells from T cell 
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leukemia (see Unigene http://www.ncbi.nlm.nih.gov/UniGene/clust.cgi?ORG=Hs&CID=87968). 
Recent report indicates that this receptor nomally aids the immune cells to sense the presence of 
unmethylated CpG dinucleotides (Hemmi H, Takeuchi O, Kawai T, Kaisho T, Sato S, Sanjo H, 
Matsumoto M, Hoshino K, Wagner H, Takeda K, Akira S. A 'l oll-like receptor recognizes 
bacterial DNA. Nature 2000 Dec 7;408(6813):740-5) and to induce proliferation of splenocytes, 
inflammatory cytokine production from macrophages and maturation of dendritic cells. The 
signaling pathway mediated by toll-like receptor 9 is through the activation of NF-kB. There is 
evidence that NF-kappaB activity is necessary for survival of lymphoma and leukemia cells 
(Constitutive activation of NF-kappaB in primary adult T-cell leukemia cells. Mori N, Fujii M, 
Dceda S, Yamada Y, Tomonaga M, Ballard DW, Yamamoto N. Blood 1999 Apr 1 ;93(7):2360-8; 
Inhibition of NF-kappaB induces apoptosis of KSHV-infected primary effusion lymphoma cells. 
Keller SA, Schattner EJ, Cesannan E. Blood, 1 October 2000, 96, No. 7, pp. 2537-2542). 
Overexpression of toll-like receptor 9 by lymphoma and leukemia cells is likely to mediate 
ligand-independent signaling, affecting the normal processes of activation, proliferation and 
tumorogenesis. Therefore the protein encoded (GMJ79960178) may serve as a potential marker 
for lymphoma and leukemia cells. In addition, human monoclonal antibodies directed against 
this protein could be therapeutics for the treatment of lymphoma and leukemia. 

MOL2 

Expression of gene MOL2 was assessed using the primer-probe set Ag743, described in 
Table 1 5. Results of the RTQ-PCR runs are shown in Table 16. 



Table 15, Probe Name: Ag743 



Primers 


Sequences 


Tm 


Lengt 
h 


Start 
Position 


SEQID 
NO: 


Forward 


5-ATGTCTTGGTGGATGCAGAA-3' 


59.1 


20 


1304 


79 


Probe 


TET-5'- 

CGGACTATAGCATTTCTAAGCGCCTCG- 
3-TAMRA 


69.2 


27 


1340 


80 


Reverse 


5'-CACATCCTCCTTGCAAATGT-3 1 


58.6 


20 


1370 


81 



Table 16. Panels 13D and 4D 



PANEL 1.3 D 




PANEL 4D 
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Tissue Name 


Rel. Expr., % 
L3dx4tm5604t ag743 
al 


rissue Name 


Rel. Expr., % 

A HtwOyl "7*7* 0 .-,""7 

4Dtmz4 / /t ag/ 
43 


Liver 

adenocarcino 
ma 


36.4 


*3768_Secondary Tnl_anti- 
CD28/anti-CD3 


11 .1 


Pancreas 


4.6 


?3769_Secondary Th2_anti- 

oiJZo/dnu-LyJ J j 


19.6 


Pancreatic ca. 
CAP AN 2 


16.6 


93770_Secondary Trl_anli- 

HTV5 0 /««♦; f^Pkl 

^uJ. of anti-cu.5 


23.2 


Adrenal gland 


6.5 


?3573_Sccondary Thl_resting 
clay 4-o m il-z 


7.2 


Thyroid 


5.2- 


93572_Secondary Th2_resting 
day 4~o in ll-z 


7.1 


Salivary gland 


12.0 


93571_Secondary Trl_resting 

J~,, A £ TT n 

day 4-0 in il-z 


11.9 


Pituitary gland 


22.7 


93568_primary Thl_anti- 
L>vz o/an n-ciJ j 


43.2 


Brain (fetal) 


8.4 


93569_primary Th2_anti- 
oiA-io/anu-CJJ.* 


32.1 


Brain (whole) 


38.6 


93570j)riinary Trl_anti- 
CDzo/ann-uLo 


50.0 


Brain 

(amygdala) 


24.8 


93565 jmmaryThl resting dy 
4-6 inIL-2 


43.8 


Brain 

(cerebellum) 


31.4 


93566_primary Th2_resting dy 
4-6inIL-2 


10.4 


Brain 

(hippocampus) 


22.0 ■ 


93567j>rimary Trl_resting dy 
4-6iriIL-2 


22.4 


Brain 

(substantia 

nigra) 


11.8 


93351_CD45RA CD4 
lymphocyte anh-CD28/anti- 
CD3 


23.0 


Brain 
(thalamus) 


30,3 


93352_CD45RO CD4 
lymphocyte anu-CD28/anti- 
CD3 


25.5 


Cerebral 
Cortex 


22.0 


9325l_CD8 Lymphocytes_anti- 
CD28/anti-CD3 


16.0 


Spinal cord 


20.6 


93353_chronic CD8 
Lymphocytes 2ry_resting dy 4-6 
in IL-2 


15.0 


CNSca. 

(glio/astro) 

U87-MG 


28.1 


93574_chronic CD8 
Lymphocytes 2ry_activated 
CD3/CD28 


10.2 


CNS ca. 

(glio/astro) 

U-118-MG 


27.9 


93354_CD4_none 


4.0 


CNS ca. 
(astro) . 
SW1783 


31.1 


93252 Secondary 
Ilil/Th2/Trl_anti-CD95 CH11 


9.6 


CNS ca * 


19,3 


93 1 03JLAK cells_jesting 


12.9 
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(neuro; met ) 
SK-N-AS 








CNS ca. 

(astro) 

SF-539 


47.2 


9378SJLAK cells_IL-2 


14.9 


CNS ca. 

(astro) 

SNB-75 


10.2 


93787_LAK cells_IL-2+lL-12 


12.5 


CNS ca. (glio) 
SNB-19 


24.3 


93789_LAK cel1s_IL-2+lFN 

gamma 


22.2 


CNS ca. (glio) 
U251 


20.9 


93790JAK cells JL-2+ IL-1 8 


15.7 


CNS ca. (glio) 
SF-295 


10.7 


93104_LAK 

cells PMA/ionomycin and IL-18 


4.0 


Heart (fetal) 


2.0 


93578_NK Cells IL-2_resting 


12.0 


Heart 


6.5 


93109_Mixcd Lymphocyte 
Reaction Two Way MLR 


9.8 


Fetal Skeletal 


0.7 


93 1 10_Mixed Lymphocyte 
Reaction Two Way MLR 


14.3 


Skeletal 
muscle 


18.3 


93 1 1 l_Mixed Lymphocyte 
Reaction Two Way MLR 


11.7 


Bone marrow 


11.2 


931 12_Mononuclear Cells 
(PBMCs) resting 


5.8 


Thymus 


7.8 


931 13_Mononuclear Cells 
(PBMCs) PWM 


51.8 


Spleen 


7.2 . . 


931 14_Mononuclear Cells 
(PBMCs)_PHA-L . 


27.5 


Lymph node 


. 7.5 


93249^_Ramos (B cell)_none 


0.2 


Colorectal 


28.4 


93250_Ramos (B 
cell) ionomycin 


6.7 


Stomach 


6.5 




100 0 


Small intestine 


10.0 


93350JB lymphbytes_CD40L 
andIL-4 


23.7 


Colon ca. 
SW480 


23.2 


92665_EOL-l 

(Eosinophil)_dbcAMP 

differentiated 


15.5 


Colon ca* 

(SW480 

met)SW620 


12.3 


93248_EOL-l 

(Eosinophil)_dbcAMP/PMAion 
omycin 


10.4 


Colon ca. 
HT29 


1.7 


93356_Dendritic Cellsnone 


15.3 


Colon ca. 
HCT-116 


13.1 


93355_Dendritic Cells_LPS 100 
ng/ml 


6.7 


Colon ca. 
CaCo-2 


12.5 


93775 Dendritic Cells anti- 
CD40 


16.7 


83219 CC 
Well to Mod 


10.4 


93774^Mohocytes_resting 


14.9 
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Diff 

(OD03866) 








Colon ca. 
HCC-299S 


11.3 


93776_Monocytes__LPS 50 
ng/ml 


4.0 


Gastric ca.* 
(liver met) 
NCI-N87 


17.6 


9358 l_Macrophages_resting 


19.5 


Bladder 


17.6 


93582 Jvlacrophages_LPS 100 
ng/ml 


7.5 


Trachea 


7.6 


93098_HUVEC 
(Endothelial) none 


33.0 


Kidney 


5.7 


93099JHUVEC 
(Endothelial)starved 


66.0 


Kidney (fetal) 


6.4 


93100JIUVEC 
(Endothelial^ IL-lb 


29.3 


Renal ca. 
786-0 


10.0 


93779 JIUVEC 
(Endothelial)_IFN gamma 


33.0 


Renal ca. 
A498 


23.6 


93102 JIUVEC 

(Endothelial)_TNF alpha + IFN 
gamma 


23.0 


Renal ca. 
RXF393 


37.8 


93101 HUVEC 
(Endothelial), TNF alpha + IL4 


19.9 


Renal ca. 
ACHN 


18.1 


93781 HUVEC 
(Endothelial) IL-11 


22.4 


Renal ca. 
UO-31 


26.1 


93583_Lung Microvascular 
Endothelial Cells none 


27.5 


Renal ca. 
TK-10 


10.9 


93584_Lung Microvascular 
Endothelial Cells^TNFa (4 
ng/ml) and ILlb (1 ng/ml) 


25.2 


Liver 


2.2 


92662_Microvascular Dermal 
endothelium none 


51.4 


Liver (fetal) 


4.4 


92663_Microsvasular Dermal 
endothelium_TNFa (4 ng/ml) 
and ILlb (1 ng/ml) 


22.5 


Liver ca. 

(hepatoblast) 

HepG2 


28.1 


93773_Bronchial 

epithelium TNFa (4 ng/ml) and 

ILlb (1 ng/ml) ** 


17.0 


Lung 


12.4 


93347_Small Airway 
Epitheliumnone 


20.9 


Lung (fetal) 


5.2 


93348_Small Airway 
Epithelium TNFa (4 ng/ml) and 
ILlb (1 ng/ml) 


84.7 


Lung ca. 
LX-1 


10.8 


92668_Coronery Artery 
SMC res tine 


37.9 


Lung ca. 
(small cell) 
NCI-H69 


20.2 


92669 Coronery Artery 

SMC TNFa (4 ng/ml) and ILlb 

(1 ng/ml) 


26.8 


Lung ca. 
ii \ 


20.1 


93 1 07_astrocytes_resting 


47.6 
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(sxell var.) 
SHP-77 








Lung ca. 
(large 
ce11)NCI- 
H460 


10.7 


93108_astrocytes_TNFa (4 
ng/ml) and ILlb (1 ng/ml) 


20.0 


Lung ca. (non- 
sm. cell) A549 


5.5 


92666_KU-812 
(Basophil)_resting 


25.0 


Lung ca. (non- 
s.cell) NCI- 
H23 


13.2 


92667_KU-812 
(Basophil)_PMA/ionoycin 


31.9 


Lung ca (non- 
s.cell) HOP- 
62 


18.3 


93579_CCD1106 
(Keratinocytes)_none 


47.0 


Lung ca. (non- 
sxl) NCI- 
H522 


13.8 


93580_CCD1106 

(Keratinocytes)_TNFa and EFNg 
** 


7.2 


Lung ca. 
(squam.) 
SW 900 


17.2 


93791_Liver Cirrhosis 


4.2 


Lung ca. 
(squam.) 
NCI-H596 


20.9 


93792_Lupus Kidney 


3.8 


Mammary 
gland 


17.8 


93577_NCI-H292 


55.1 


Breast ca.* 
(pi. effusion) 
MCF-7 


20.4 


93358_NCI-H292_IL-4 


68.8 


Breast ca.* 
(pl.ef) MDA- 
MB-231 


22.5 


93360 JTCI-H292JL-9 


59.5 


Breast ca.* 
(pi. effusion) 
T47D 


15.5 


93359_NCI-H292_IL-13 


. 35.1 


Breast ca. . 
BT-549 


13.9 


93357_NCI-H292_EFN gamma 


37.9 


Breast ca. 
MDA-N 


9.9 


93777_HPAEC_- 


32.3 


Ovary 


5.6 


93778_HPAEC_IL-1 beta/TNA 
alpha 


23.2 


Ovarian ca. 
OVCAR-3 


15.4 


93254_Normal Human Lung 
Fibroblast none 


29.7 


Ovarian ca. 
OVCAR-4 


12.4 


93253_Normal Human Lung 
Fibroblast TNFa (4 ng/ml) and 
IL-lb (1 ng/ml) 


16.8 


Ovarian ca. 
OVCAR-5 


27.2 


93257_Normal Human Lung 
Fibroblast IL-4 


55.5 


Ovarian ca. . 
OVCAR-8 


9.0 


93256_Normal Human Lung 
Fibroblast EL-9 . 


34.9 
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Ovarian ca. 
IGROV-1 


22.6 


93255_Normal Human Lung 
Fibroblast IL-13 


36.9 


Ovarian ca.* 
(ascites) SK- 
OV-3 


100.0 


93258_Normal Human Lung 
FibroblastJFN gamma 


58.6 


Uterus 


*,1 


93106_Dermal Fibroblasts 
CCD 1070 resting 


74.2 


Plancenta 


15.8 


93361_Dermal Fibroblasts 
CCD 1070 TNF alpha 4 ng/ml 


64.2 


Prostate 


5.6 


93105 Dermal Fibroblasts 
CCD1070JL-1 beta 1 ng/ml 


27.7 


Prostate ca.* 
(bone met)PC- 

3 


50.1 


93772 dermal fibroblast_IFN 
gamma 


24.0 


Testis 


8.3 


93771 dermal fibroblast IL-4 


58.2 


Melanoma 
Hs688(A).T 


12.6 


93259_BD Colitis 1** 


2.8 


Melanoma* 
(met) 

Hs688(B).T 


14.5 


93260JBD Colitis 2 


LI 


Melanoma 
UACC-62 


18.8 


9326 1_IBD Crohns 


2.1 


Melanoma 
M14 


19.7 


73501 0_Colon_normal 


14.3 


Melanoma 
LOXIMVT 


8.8 


7350 1 9JLung_none 


21.5 


Melanoma* 
(met) SK- 
MEL-5 


11.0 


64028- lJITiymus_none 


27.0 


Adipose 


14.5 


64030- l_Kidney_none 


31.2 



MOL2 is widely expressed in tissues and cell lines represented in both panels 1 .3D and 
4D, with highest expression being in one ovarian cancer cell line (SK-OV-3). Thus, it could 
serve as a diagnostic marker for ovarian cancer. 



MOL3 

Expression of MOL3 was assessed using the primer-probe sets Ag474 and Ag770, 
described in Tables 1 7 and 1 8. Results of the RTQ-PCR runs are shown in Tables 19 and 20. 

10 Table 17. Probe name: Ag474 
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WO 01/81578 



PCT/US01/I3578 



Primers 




T jf*t\ vlri 


Start 

Position 


NO: 


Forward 


5'-GGCACTGTCCCTCTGCACAT-3' 


20 


493 


82 


Probe 


FAM-5'- 

CCCTGAGAAAGATCTGCCACAAAGACATCT 
G-3-TAMRA 


31 


516 


83 


Reverse 


5'-AACCTGCCCACAGAGCAATC-3' 


20 


549 


84 



Table 18. Probe name: Ag770 



Primers 


Sequences 


Tm 


Lengt 
h 


Start 

Positio 

n 


Forward 


5'-AC AGTGCTGTCGCTGGTACTT-3 ' 


60 


21 


717 


Probe 


FAM-5'- 

TTCGTACTGAAAGGCGTACCTCTCCA-3 , . 
TAMRA 


67.9 


26 


746 


Reverse 


5'-CTCAAACAGCTCACG AGTG AT-3 ' 


58.1 


21 


773 



5 Table 19. Ag474 



PANEL 1.3D 




PANEL 2D 




Tissue Name 


Rel. Hxpr., % 
1.3Dtm3254f_ag474 


Tissue.Name 


Rel. Expr., % 
2Dtm3255f ag47 
4 


Liver 

adenocarcinoma 


0.0 


Normal Colon GENPAK 
061003 


0.3 


Pancreas 


0.0 


83219 CC Well to Mod 
Diff(OD03866) 


0.0 


Pancreatic ca. 
CAP AN 2 . 


0.0 


83220 CC NAT 
(OD03866) . 


0.0 


Adrenal gland 


0.3 


83221 CC Gr.2 
rectosigmoid (OD03868) 


0.0 


Thyroid 


0.0 


83222 CCNAT 
(OD03868) 


0.0 


Salivary gland 


0.0 


83235 CC Mod Diff 
(ODO3920) 


0.0 


Pituitary gland 


0.0 


83236 CCNAT 
(ODO3920) 


0.0 


Brain (fetal) 


0.2 


83237 CC Gr.2 ascend 
colon (OD03921) 


0.0 


Brain (whole) 


0.0 


83238 CCNAT 
(OD03921) 


0.0 


Brain (amygdala) 


0.1 


83241 CC from Partial 
Hepatectomy (ODO4309) 


3.8 


Brain (cerebellum) 


0.0 


83242 Liver NAT 
(ODO4309) 


100.0 
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WO 01/81578 
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Brain 

(hippocampus) 


0.0 


i/4 /z Colon mets to lung 
(OD04451-01) 


a a 
u.u 


Brain (substantia 
nigra) 


0.0 


37473 Lung NA J 
(OD04451-02) 


A A 

U.U 


Brain (thalamus) 


0.0 


Normal Prostate Clontech 
\+ 6546-1 


A A 
U.U 


Cerebral Cortex 


0.1 


*4140 Prostate Cancer 
(OD04410) 


A A 
U.U 


Spinal cord 


0.0 


84141 Prostate NAT 
OD04410) 


0.0 


CNS ca. (glio/astro) 

i ion x gr* 

U87-MG 


0.3 


87073 Prostate Cancer 

/ r /"YT > *Ay1TJA A1\ 

OD04720-U1) 


0.0 


CNS ca. 

(gho/astro) U-118- 
MG 


0.0 


87074 Prostate NAT 
(OD04720-02) 


0.0 


CNS ca. (astro) 
SW1783 


0.0 


Normal Lung GENPAK 
061010 


A A 

U.U 


CNS ca.* (neuro; 
met ) SK-N-AS 


0.0 


i>32 jy Lung Met to Muscle 
(OD04286) 


a a 
U.U 


CNS ca. (astro) 
SF-539 


0.0 


53240 Muscle IMA J 
(OD04286) 


a a 

U.U 


CNS ca. (astro) 
SNB-75 


0.0 


84136 Lung Malignant 
Cancer (OD03126) 


A A 

U.U 


CNS ca. (glio) 
SNB-19 


0.0 


Oji 1 T7 T XT A T 

54137 Lung NAT 
(OD03126) 


A A 
U.U 


CNS ca. (glio) 
U251 


0.0 


84871 Lung Cancer 
(OD04404) 


A A 
U.U 


CNS ca. (glio) 
SF-295 


0.0 


O A O T XI A T" 

84872 Lung NAT 
(OD04404) 


A A 
U.U 


Heart (fetal) 


0.0 


648 75 Lung Cancer 
(OD04565) 


A A 
U.U 


Heart 


' A A 

0.0 


84576 Lung MAI 
(OD0A565) 


A A 
U.U 


Fetal Skeletal 


0.2 


85950 Lung Cancer 
(OD04237-01) 


A A 
U.U 


Skeletal muscle 


0.0 


85970 Lung NAT 
(OD04237-02) 


A A 

0.0 


Bone marrow 


0.0 


83255 Ocular Mel Met to 
Liver (ODO4310) 


A A 

U.U 


Thymus 


0.0 


OO T XT A T 

83256 Liver NA1 
(ODO4310) 


t>4.Z 


Spleen 


0.2 


84139 Melanoma Mets to 
Lung (OD04321) 


A A 
U.U 


Lymph node 


A A 

u.u 


ffjll^fi T urtrr "MAT 

OH I JO i^ungj\Ai 
(OD04321) 


OA 


Colorectal 


0.0 


Normal Kidney GENPAK 
061008 


0.0 


Stomach 


0.1 


83786 Kidney Ca, 
Nuclear grade 2 


0.0 
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(OD04338) 




Small intestine 


0.0 


83787 Kidney NAT 
(OD04338) 


0.0 


Colon ca. 
SW480 


0.3 


83788 Kidney Ca Nuclear 
grade 1/2 (OD04339) 


0.0 


Colon ca* (SW480 
met)SW620 


0.0 


83789 Kidney NAT 
(OD04339) 


0.0 


Colon ca. 
HT29 


0.0 


83790 Kidney Ca, Clear 
cell type (OD04340) 


0.1 


Colon ca. 
HCT-116 


0.0 


83791 Kidney NAT 
(OD04340) 


0.0 


Colon ca. 
CaCo-2 


0.3 


83792 Kidney Ca, Nuclear 
grade3(OD04348) 


0.0 


83219 CC Well to 
Mod Diff 
(OD03866) 


0.3 


83793 Kidney NAT 
(OD04348) 


0.0 


Colon ca. 
HCC-2998 


0.5 


87474 Kidney Cancer 
(OD04622-01) 


0.0 


Gastric ca.* (liver 
met)NCI-N87 


0.0 


87475 Kidney NAT 
(OD04622-03) 


0.0 


Bladder 


0.0 


85973 Kidney Cancer 
(OD04450-01) 


0.0 


Trachea 


0.0 


85974 Kidney NAT 
(OD04450-03) 


0.1 


Kidney " 


0.0 


Kidney Cancer Clontech 
8120607 • 


0.0 


Kidney (fetal) 


0.0 . 


Kidney NAT Clontech 
8120608 


0.0 


Renal ca. 
786-0 


0.0 


Kidney Cancer Clontech 
8120613 


0.0 


Renal ca. 
A498 


0.0 


^idney NAT Clontech 
8120614 


0.0 


Renal ca. 
RXF393 


0.0 


Kidney Cancer Clontech 
9010320 


o.a 


Renal ca. 
ACHN 


0.0 


Kidney NAT Clontech 
9010321 


0.0 


Renal ca. 
UO-31 


0.3 


Normal Uterus GENPAK 
061018 


0.0 


Renal ca. 
TK-10 


0.1 


Uterus Cancer GENPAK 
064011 


0.0 


Liver 


100.0 


Normal Thyroid Clontech 
A+ 6570-1 


0.0 


Liver (fetal) 


1.8 


Thyroid Cancer GENPAK 
064010 


0.0 


Liver ca. 

(hepatoblast) 

IIepG2 


0.0 


Thyroid Cancer 
INVITROGEN A302 1 52 


0.0 


Lung 


0.0 


Thyroid NAT 
INVITROGEN A302153 


0.0 
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Lung (fetal) 


n ft 


LNlsIlildl Dlvaai Uulll IV 

061019 


0 0 


Lung ca. (small 
cell) LX-1 


n n 
u.u 


Rrf»acl PariPPr 

(OD04566) 




Lung ca. (small 
cell) NCI-H69 


A A 

u.u 


ojy /-> isreast oancer 
(OD04590-01) 




Lung ca. (s.cell 
var.) SHP-77 


n a 
u.u 


03y /o oreasi L^ancei ivieis 
(OD04590-03) 




Lung ca. (large 
cell)NCI-H460 


ft ft 
u.u 


R*7ft7ft *Orf*<ict Pflnrpr 

Metastasis (OD04655-05) 


0 0 


Lung Ca. ^nUil'MU. 

cell) A549 


n i 

U. 1 


064006 


0 2 


JLung ca. (non- 
s.cell) NCI-H23 


ft 0 
u.z 


DlCaM L<aUCCi lYCs. VjCJl. 

1024 


0 1 


Lung ca (non-s.cell) 
HOP-62 


ft ft 
U.U 


ureasx L/ancer ^/loniecn 
9100266 




Lung ca. vnon-s.ci j 
NCI-H522 


ft ft 

u.u 


U rAac f KJ AT r K \r\*ntf>r~\-k 

rsreasi in/\ i Lioniecn 
9100265 


0 0 


I linn /% #i / P/iiiom i 

Lung ca. (squarn.j 
SW900 


ft ft 


i>rcasi dancer 
INVITROGEN A209073 


0 0 


Lung ca. (squarn.j 
NCI-H596 


ft ft 
u.u 


"Rrf»ac* MAT 

INVITROGEN A2090734 


0 0 


Mammary gland 


A ft 

u.u 


iNorrnaJ L/iver vjhini /yjv 
061009 




breast ca. (pi. 
effusion) MCF-7 


A A 


Liver L-ancer ol#jnx\ajs. 
064003 


AQ ft 


Breast ca.* (pl.ef) 

MDA-iVLD-ZJ 1 


0.0 


Liver Cancer Research 
vjenciics xnjna iuz.j 


62.4 


Breast ca.* (pi. 
effusion) T47D 


0.0 


Liver Cancer Research 
Genetics RNA 1026 


12.9 


Breast ca. 
BT-549 


A A 


Paired Liver Cancer Tissue 
Research Genetics RNA 
6004-T 




Breast ca. 
MDA-N 


A 1 

0.1 


Paired Liver Tissue 
Research Genetics RNA 
6004-N 




Ovary • 


0.0 


Paired Liver Cancer Tissue 
Research Genetics RNA 

£ftfK T 
OUUJ-l 




Ovarian ca. 
OVCAR-3 


0.5 


Paired Liver Tissue 
ivcsearcn oencucs jvina. 
6005-N 


18.6 


Ovarian ca. 
OVCAR-4 


A A 

u.u 


MattyioI Vl\nAAt*r C^T*f»Jl> AV 

061001 


v.U 


Ovarian ca. 
OVCAR-5 


0.0 


Bladder Cancer Research 
Genetics RNA 1023 


0.0 


Ovarian ca. 
OVCAR-8 


0.0 


Bladder Cancer 
INVITROGEN A302173 


0.0 


Ovarian ca. 


0.0 


87071 Bladder Cancer 


0.0 
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Ovarian ca.* 

(ascites 


0.0 


87072 Bladder Normal 

AHinrAnt fnTVM71£ A^ 
rVUJauCIiL ^vyJL/U*r / 1 o~wjj 


0.0 


Uterus 


0.0 


Normal Ovary Res. Gen. 


0.0 


Placenta 


U.U 


uvanan cancer onjNr ajv 
064008 


U.U 


Prostate 


A 1 
U.l 


o /4yz uvary cancer 
(OD04768-07) 


u.l 


r lUoLalC Ca. ^UUllC 

met)PC-3 


A A 
U.U 


o uvary xnai 
(OD04768-08) 


n a 

U.U 


Testis 


A 1 
U.J 


jNormaj oiomacn otirMrAJs. 
061017 


U.U 


1V1C IdUUIIla 

Hs688(A).T 


A A 
U.U 


oasmc cancer ciomecn 
9060358 


A A 
U.U 


lVlClallUIUu ^HJCIJ 

Hs688(B).T 


A A 
U.U 


9060359 


A A 
U.U 


lYlCiallUIUa 

UACC-62 


A A 
U.U 


vjasmc cancer ciomecn 
9060395 


U.H 


IViCldllUIIla 

M14 


A A 


in/\i oiomacn cioniecn 
9060394 


A A 
U.U 


TV/f p 1 a nntn a 

LOXIMVI 


A A 


vjabinc cancer ^iuhiccq 
9060397 


U.J 


Melanoma* (met) 
SK-MEL-5 


0.0 


NAT Stomach Clontech 
9060396 


0.0 


Adipose 


0.0 


Gastric Cancer GENPAK 
064005 


0.0 



Table 20. Ag770 



PANEL 1.3D 




PANEL 4D 






Tissue Name 


Rel. Expr., % 
1.3dx4tm5495 
f_ag770_b2 


Tissue Name 


Rel. Expr., % 
4dtml843f ag7 
70 


Rel. Expr., 
% 

4Dtml910f 
ag770 


Liver 

adenocarcinoma 


. 0,0 


93768_Secondary 
Thl_anti-CD28/anti-CD3 


0.0 


. 0.0 


Pancreas 


0.0 


93769_Secondary 
Th2_anti-CD2 8/anti-CD3 


6.4 


0.0 


Pancreatic ca. 
CAP AN 2 


0.0 


93770_Secondary 
Trl_anti-CD28/anti-CD3 


3.1 


0.0 


Adrenal gland 


0.0 


93573_Secondary 

Thl resting day 4-6 in IL- 

2 


0.0 


0.0 


Thyroid 


0.0 


93572_Secondary 

Th2 resting day 4-6 in IL- 

2 


0.0 


0.0 


Salivary gland 


0.0 


93571_Secondary 
Trl_resting day 4-6 in IL-2 


0.0 


0.0 
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Pituitary gland 


0.0 


9356$_primary Thl_anti- 
CD2S/anti-CD3 


3.7 


0.0 


Brain (fetal) 


0.0 


93569_primary Th2_anti- 
CD28/anti-CD3 


0.0 


0.0 


Brain (whole) 


0.0 


93570_primary Trl_anti- 
CD28/anti-CD3 


0.0 


0.0 


Brain 

(amygdala) 


0.0 


93565__primary 
Thl_resting dy 4-6 in 1L-2 


0.0 


4.8 


Brain 

(cerebellum) 




7jjdo priiiidiy 

Th2_resting dy 4-6 in IL-2 


0 0 


2.2 


Brain 

(hippocampus) 


0.1 


93567_primary Trl_resting 
dy 4-6 in IL-2 


0.0 


7.0 


Brain 

(substantia 

nigra) 


0.0 


93351_CD45RA CD4 

lymphocyte_anti- 

CD28/anli-CD3 


3.1 


0.0 


Brain 
(thalamus) 


0.0 


93352_CD45RO CD4 
ymphocyte anti- 
CD28/anti-CD3 


0.0 


0.0 


Cerebral Cortex 


0.0 


93251_CD$ 

Lymphocytes_anti- 

CD28/anti-CD3 


1.4 


0.0 


Spinal cord 


0.0 


93353_chronic CD8 
Lymphocytes 2ry_resting 
day 4-6 in IL-2 


0.0 


0.0 


CNS ca. 

(glio/astro) 

U87-MG 


0.0 


93574_chroriic CD8 

Lymphocytes 

2ry activated CD3/CD28 


0.0 


0.0 


CNS ca. 
(glio/astro) U- 
118^MG : 


0.0 


93354_CD4_none 


0.0 


0.0 


CNS ca. (astro) 
SW1783 


0.0 


93252_Secondary 
Thl/Th2/Trl anti-CD95 
CH11 


0.0 


41.2 


C/JNo ca. 
(neuro; met ) 
SK-N-AS 






0 ft 


V.V 


CNS ca. (astro) 
SF-539 


0.0 


937S8_LAKcells_IL-2 


0.0 


0.0 


CNS ca. (astro) 
SNB-75 


0.0 


93787 LAK cells IL- 
2+EL-12 


0.0 


0.0 


^■PID va. ^JJilv^ 

SNB-19 


\J.\J 


9V7R9 T AK celk IT - 

2+IFN gamma 


0 0 


0.0 


CNSca.(glio) 
U251 


0.0 


93790 LAK cells IL-2+ 
IL-18 


0.0 


0.0 


CNS ca. (glio) 
SF-295 


0.0 


93104_LAK 

cells PMA/ionomycin and 
IL-18 


0.0 


5.9 
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tieart (tetal) 


A A 

0.0 


93578_NK Cells IL- 
2resting 


0.0 


0.0 


rieart 


A O 

0.3 


AO 1 AA \ /f" J T _ 1_ _j 

93109_Mixed Lymphocyte 
Reaction_Two Way MLR 


6.0 


0.0 


reiai skeletal 


A A 
0.0 


AO 1 1 A V Ml 1 T _ | _j 

93 1 10_Mixed Lymphocyte 
Reaction_Two Way MLR 


0.0 


0.0 


Skeletal muscle 


A A 
0.0 


AO 1 1 1 "V 4" 1 T t . 

93 1 1 l_Mixed Lymphocyte 
Reaction_Two Way MLR 


0.0 


1.3 


Bone marrow 


A A 
0.0 


AO 1 IO > jf - _ 1 ^ /~\ _ 11 _ 

93 1 12_MononucIear Cells 
(PBMCsLresting 


0.0 


18.4 


Thyraus 


A A 
0.0 


93 1 1 3_Mononuclear Cells 
(PBMCs)_PWM 


A n 

0.0 


A A 

0.0 


Spleen 


A A 
0.0 


931 14_Mononuclear Cells 
(PBMCs)_PHA-L 


A A 

0.0 


6.6 


Lymph node 


A A 
0.0 


AO^>lA T> „ ■. . ■ ■-. /T"> 

93249_Ramos (B 
cell)_none 


5.7 


A A 

0.0 


Colorectal 


0.1 


93250_Ramos (B 
cell)_ionomycin 


0.0 


0.0 


Stomach 


0.0 


93349_B 

lymphocytes_PWM 


0.0 


0.0 


Small intestine 


0.0 


93350_B 

lyniphoytes CD40Land 
IL-4 


0.0 


0.0 


Colon ca. 
SW480 


0.0 


92665.EOL-1 

(Eosinophil)_dbcAMP 

differentiated 


0.0 


3.2 


Colon ca.* 

(SW480 

met)SW620 


0.2 


93248 JEOL-1 

(Eosinophil)_dbcAMP/PM 

Aionomycin 


0.0 


0.0 


Colon ca. 
HT29 


A A 

0.0 


93356_Dendntic 
Cells_none 


A /\ 

0.0 


0.0 


Colon ca. 
HCT-11$ 


0.0 


93355JDendritic 
Cells_LPS lOOng/ml 


0.0 


0.0 


Colon ca. 
CaCo-2 


0.0 


93775_Dendritic 
Cells_anti-CD40 


0.0 


0.0 


83219 CC Well 
to Mod Diff 
(OD03866) 


0.0 


93774_Monocytes_resting 


0.0 


0.0 


Colon ca. 
HCC-2998 


1.3 


93776 JrfonocytesJLPS 50 
pg/ml 


0.0 


0.0 


Gastric ca * 
(liver met) NCI- 
N87 


0.2 


93 58 1 _Macrophages_restin 
g 


1.1 


0.0 


Bladder 


0.0 


93582_Macrophages_LPS 
100 ng/ml 


0.0 


5.9 


Trachea 


0.0 . 


93098_HUVEC 
(Endothelial)_none 


0.0 


0.0 
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Kidney 


0.0 


93099_HUVEC 
(Endothelial)_starved 


0.0 


0.0 


Kidney (fetal) 


0.0 


93100 HUVEC 
(Endothelial)_IL-lb 


0.0 


0.0 


Renal ca. 
786-0 


0.0 


93779_HUVEC 
Endothelial)_IFN gamma 


0.0 


0.0 


Renal ca. 
A498 


0.5 


93102_HUVEC 
(Endothelial)_TNF alpha + 
GFN gamma 


0.0 


0.0 


Renal ca. 
RXF393 


0.0 


93101_HUVEC 
(Endothelial) TNF alpha + 
IL4 


0.0 


0.0 


Renal ca. 
ACHN 


0.0 


93781_HUVEC 
(Endothelial)JL-l 1 


0.0 


0.0 


Renal ca. 
UO-31 


0.0 


93583_Lung 

Microvascular Endothelial 
Cells none 


0.0 


0.0 


Renal ca. 
TK-10 


0.0 


93584_Lung 

Microvascular Endothelial 
Cells TNFa (4 ng/ml) and 
ILlb (I ng/ml) 


0.0 


0.0 


Liver 


100.0 


92662_Microvascular 
Dermal endotheliumnone 


0.0 


0.0 


Liver (fetal) 


0.3 


92663_Microsvasular 
Dermal endothelium_TNFa 
(4 ng/ml) and ILlb (1 
ng/ml) 


0.0 


0.0 


Liver ca. 

(hepatoblast) 

HepG2 


0.0 


93773_Bronchial 
epithelium TNFa (4 ng/ml) 
and ILlb (1 ng/ml) ** 


0.0 


2.2 


Lung 


. 0.0 


93347JSmall Airway 
EpitheIium_none 


.. o.o 


0.0 


Lung (fetal) 


0.0 


93348_Small Airway 
Epithelium_TNFa (4 
ng/ml) and ILlb (1 ng/ml) 


0.0 


3.7 


Lung ca. (small 
cell) LX-1 


0.0 


92668_Coronery Artery 
SMCresting 


0.0 


0.0 


Lung ca. (small 
cell) NCI-H69 


0.0 


92669_Coronery Artery 
SMC_TNFa (4 ng/ml) arid 


0.0 


0.0 


Lung ca. (s.cell 
van) SHP-77 


0.0 


93 1 07_astrocytes_resting 


0.0 


3.4 


Lung ca. (large 
cell)NCI-H460 


0.0 


93 1 08_astrocytes_TNFa (4 
ng/ml) and ILlb (1 ng/ml) 


0.0 


0.0 


Lung ca. (non- 
sin, cell) A549 


0.0 


92666_KU-812 . 
(Basophil)_resting 


0.0 


1.6 


Lung ca. (non- 
s.cell) NCI-H23 


0.6 


92667_KU-812 
(Basophil)_PMA/ionoycin 


0.0 


0.0 
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Lung ca (non- 
s.cell) HOP-62 


0.0 


93579 CCD1106 
(Keratinocytes)_none 


0.0 


0.0 


Lung ca. (non- 
s.cl) NCI-H522 


0.0 


93580_CCD1106 
(Keratinocytes) TNFaand 
IFNg ** 


0.0 


0.0 


Lung ca. 
(squam.) SW 
900 


0.5 


93791_Livcr Cirrhosis 


100.0 


100.0 


Lung ca. 
(squam.) NCI- 
H596 


0.0 


93792_Lupus Kidney 


0.0 


0.0 


Mammary gland 


0.0 


93577_NCI-H292 


0.0 


0.0 


Breast ca.* (pi. 
effusion) MCF- 
7 


0.0 


93358_NCI-H292_IL-4 


0.0 


0.0 


Breast ca.* 
(pl.ef) MDA- 
MB-231 


0.0 


93360_NCI-H292_IL-9 


0.0 


0.0 


Breast ca.* (pi. 

effusion) 

T47D 


0.0 


93359_NCI-H292_IL-1 3 


0.0 


0.0 


Jut I tj/LtOl WU. 

BT-549 


0 0 


gamma 




n fl 


Rrea^t ca 

MDA-N 


0 0 

V.V 


0X111 TTPAFP - 




3 A 
Or* 


Ovary 


0.0 


9377.8_HPAEC_IL-1 
beta/ 1 N A alpha 


0.0 


0.0 


Ovarian ca. 

cwir* AD 1 
UVLAK-3 ■ 


0.0 


93254_Normal Human 
Lung Fibroblast_none 


3.4 


o.p 


Ovarian ca. 


0.0 


93253_Normal Human 
Lung riDroDlast^irNra (4 
ng/ml) and IL-lb (1 ng/ml) 


0.0 


0.0 


Ovarian ca 
OVCAR-5 


0 0 

v.v 


Q39^7 Mrirmal Human 
I i^i Hi ilia 1 X 1U ill all 

Lung Fibr6blast_IL-4 


• 0 0 




Ovarian ca. 

r\\ at) o 


0.0 


93256_Normal Human 
Lung ribroblast_IL-9 


0.0 


0.0 


Ovarian ca. 

luKUV-J 


0.0 


93255_Normal Human 
Lung FibroDlastJL-13 


0.0 


0.0 


Ovarian ca.* 
(ascites) SK- 
OV-3 


0.0 


93258_Normal Human 
Lung Fibroblast JFN 
gamma 


0.0 


0.0 


Uterus 


0.5 


93106_Dernial Fibroblasts 
CCD1070_resting 


0.0 


0.0 


Plancenta 


0.0 


93361 Dermal Fibroblasts 

CCD1070JINFalpha4 

ng/ml 


0.0 


0.0 
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Prostate 


0.0 


93105 Dermal Fibroblasts 
CCD1070JL-1 beta 1 
ng/ml 


0.0 


4.3 


Prostate ca * 
(bone met)PC-3 


0.0 


93772_dermal 
fibroblast_IFN gamma 


0.0 


7.9 


Testis 


0.4 


93771_dermal 
fibroblast_IL-4 


0.0 


0.0 


Melanoma 
Hs688(A).T 


0.0 


93259 JBD Colitis 1** 


6.0 


13.1 


Melanoma* 
(met) 

Hs688(B).T 


0.0 


93260 JBD Colitis 2 


2.8 


5.2 


Melanoma 
UACC-62 


0.3 


93261_IBDCrohns 


0.0 


1 ft 

1.9 


Melanoma 
M14 


0.1 


73501 0_Colon_normal 


1.3 


3.2 


Melanoma 
LOX IMVT 


0.0 


735019_Lung_none 


14.9 


0.0 


Melanoma* 
(met) SK- 
MEL-5 


0.4 


64028- l_Thymus_none 


2.3 


7.3 


Adipose 


0.0 


64030- l_Kidney_none 


0.0 


0.0 



Both probe/primer sets are specific for the sequence of gene Acc. No. MOL3. Unigene 
data at http://www.ncbi.nlm.iiih.gov/UmGene/clustcgi7d 

and our RTQ-PCR panels 1 .3D,2D and.4D indicate that this gene is specifically 
expressed by the liver and upregulated in some hepatocellular carcinomas (HCCs).There is 
evidence suggesting that the examination of the serum complements may be a useful tool for the 
detection of HCCs in liver cirrosis (LC) patients. (Takezaki E; Murakami S, Nishibayashi H s 
Kagawa K, Ohmori H. Gan No Rinsho 1990 Oct;36(12):21 19-22). Therefore the serum level of 
this protein can be used as a diagnostic marker to detect LC and HCC and antibodies directed 
against this protein can be a potential therapeutic agent against LC and HCC. In addition, this 
molecule can also serve as a specific marker for differentiating liver from other tissues. 



MOL4 

Expression of gene MOL4 was assessed using the primer-probe set Agl61 1, described in 
Table 21 

171 ... 
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Table 21. Probe Name: Agl611 



Primers 


Sequences 


Tm 


Lengt 
h 


Start 
Position 


SEQID 
NO: 


Forward 


5'-ATATGCTGTGCTGCATTCAGT-3' 


58.4 


21 


40 


85 


Probe 


TET-5'- 

CTGCCTGGTCAGTGAACAATTTCCTG- 
3-TAMRA 


68.9 


26 


65 


86 


Reverse 


5'-CAAGGCCACACTAGTCGTGTAG-3' 


59.9 


22 


117 


87 



Expression of this gene in panels 1 .3D, AD and 2 is at very low to undetectable levels (Ct 
5 values>35) in a number of tissues. 

MOL6 

Expression of gene MOL6a was assessed using the primer-probe set Agl 167, described 
in Table 22. Results of the RTQ-PCR runs are shown in Tables 23 and 24. 
10 Table 22. Probe Name: Agll67 



Primers 


Sequences 


Tm 


Lengt 
h 


Start 
Position 


SEQID 
NO: 


Forward 


5'-TCTTTGCTGACTCATCTGTTCA-3' 


58.7 


22 


51 


88 


Probe 


TET-5'- 

AAGAAGACCCTGCTCCCTATTTGGTG- 
3'-TAMRA 


67.5 


26 . 


75 


89 


Reverse 


5'-AGGGGTTGAAGTGAGACTTGAG-3 ' 


59.8 


22 


104 


90 



Table 23. Panels 1.3D and 4D 



PANEL 1.3D 




PANEL 4D 




Tissue Name 


Rel. Expr., 

% 

1.3dx4tm55 
86t agl 167 
al 


Tissue Name 


Rel. Expr., 

% 

4Dtml937t 
_agll67 


Liver 

adenocarcinoma 


1.1 


93768 Secondary Thl anti-CD28/anti- 
CD3 


1.7 


Pancreas 


0.0 


93769 Secondary Th2 anti-CD28/anti- 
CD3 


0.0 


Pancreatic ca. 
CAP AN 2 


1.0 


93770 Secondary Trl anti-CD28/anti- 
CD3 ~ 


0.9 


Adrenal gland 


0.0 


93573 Secondary Thl resting day 4-6 
inIL-2 


2.7 
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Thyroid 


0.0 


93572_Secondary Th2_resting day 4-6 
inIL-2 


0.0 


Salivary gland 


0.0 


93571_Secondary Trl_resting day 4-6 ' 
inIL-2 


0.0 


Pituitary gland 


2.1 


93568_pnmary Thl anti-CD28/anti- 
CD3 


0.0 


rJram (tetal) 


A 1 

4.1 


935oy__pnrnary ixil anu-CJJzo/anti- 
CD3 




Brain (whole) 


1.1 


9357U_pnmary irl anti-ulJ2s/anti- 
CD3 


LA 


Brain (amygdala) 


A 1 

4.1 


93565_pnmary In ] resting dy 4-6 in 
IL-2 




Brain (cerebellum) 


6.7 


93566_pnmary Th2 resting dy 4-6 in 
IL-2 


3.7 


Brain (hippocampus) 


4.3 


93567_pnmary Trl_resting dy 4-6 in 
IL-2 


3.7 


Brain (substantia 
nigra) 


5.4 


93 3 5 1_CD45RA CD4 
ymphocyte_anti-CD28/anti-CD3 


1.0 


Brain (thalamus) 


4.7 


93352_CD45RO CD4 
lymphocyte_anti-CD28/anti-CD3 


2.6 


Cerebral Cortex 


1.3 


93251 CDS Lymphocytes anti- 
CD28/anti-CD3 


0.9 


Spinal cord 


1.6 


93353_chronic CD8 Lymphocytes 
2ry_resting dy 4-6 in IL-2 


0.4 


CNS ca, (glio/astro) 
U87-MG 


. 0.0 


93574_chronic CDS Lymphocytes 
2ry_activated CD3/CD28 


0.9 


uino ca. yjno/ astro j 
U-118-MG. 


\),\) 


7jj j*f_L/jj*t_none 




v^in o ca. ^asiro ) 
SW1783 


U.u 


yjZDZ oeconoary lni/xnz/iri anii- 
CD95 CH11 




CNS ca.* (neuro; met 
)SK-N-AS 


0.0 


93 103_LAK cells j-estihg . 


7.0 


CNS ca. (astro) 
SF-539 


0.0 


93788JLAK cells_IL-2 , 


0.9 


CNS ca. (astro) • 
SNB-75 


2.4 


93787_LAK rells_IL-2+IL-12 


1.6 


PTSJC n*x (n\\rC\ 
^INo Ctt. VS"OJ 

SNB-19 


A 1 
H.J 


07720 T A V nolle TT _0-4-TfTM rrommo 

7$ 1 oy__i>/\jfk. ce]is_jj-r^T^irJN gamma 




CNS ca. (glio) 
U251 


1.3 


93790 JLAK cells JL-2+ EL- 18 


1.7 


CNS ca. (glio) 
SF-295 


0.0 


93 1 04_JLAk cells_PMA/ionomycin 
andIL-18 


0.0 


Heart (fetal) 


0.0 


93578_NK Cells IL-2_resting 


2.2 


Heart 


0.0 


93 109_Mixed Lymphocyte 
Reaction_Two Way MLR 


2.4 


Fetal Skeletal 


0.0 


93 1 1 0_Mixed Lymphocyte 
Reaction_Two Way MLR 


1.6 . 
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Skeletal muscle 


1.0 


931 1 1 Mixed Lymphocyte 
Reaction JTwo Way MLR 


0.0 


Bone marrow 


0.0 


93 1 1 2_Mononuclear Cells 
(PBMCs) resting 


0.0 


Thymus 


0.0 


93 1 1 3_Mononuclear Cells 
(PBMCs)_PWM 


6.0 


Spleen 


16.0 


93 1 1 4_Mononuclear Cells 
(PBMCs)_PHA-L 


0.9 


Lymph node 


1.4 


93249_Ramos (B cell)_none 


6.8 


Colorectal 


0.0 


93250_Ramos (B ceIl)_ionomycin 


5.9 


Stomach 


0.0 


93349_B lymphocytes^PWM 


3.8 


Small intestine 


2.9 


93350J3 lymphoytes_CD40L and IL-4 


2.4 


Colon ca. 
SW480 


0.0 


92665_EOL-l (EosinophilX_dbcAMP 
differentiated 


82.9 


Colon ca * (SW4S0 
met)SW620 


1.7 


93248_EOL-l 

(Eosinophil)_dbcAMP/PMAionomycin 


100.0 


Colon ca. 
HT29 


0.0 


93356_Dendritic Cells_none 


2.5 


Colon ca. 
HCT-116 


3.9 


93355_Dendritic Cells_LPS 100 ng/ml 


0.9 


Colon ca. 
CaCo-2 


1.0 


93775_Dendritic Cells_anti-CD40 


2.5 


83219 CC Well to 
Mod Diff(OD03866) 


0.0 


93774_Monocytes_resting 


0.0 


Colon ca. 
HCC-2998 


1.9 


93776 JrfonocytesJLPS 50 ng/ml 


0.8 


Gastric ca.* (liver 
met)NCI-N87 


2.3 


9358 l_Macrophages_resting 


0.8 


Bladder 


4.9 


93582_Macrophages_LPS 100 ng/ml 


0.4 


Trachea 


0.0 


93098_HUVEC (Endothelial)_none 


0.9 


rwiuncy 


T ft 


y j\)7 j_ou v jla^ ^linaouieiiai j_siarveG 


1 C 

l.J 


Kidney (fetal) 


2.3 


93100_HUVEC (Endothelial)_IL-lb 


0.0 


Renal ca. . 
786-0 


0.0 


93779_HUVEC (Endothelial)JFN 
gamma 


2.4 


Renal ca. 
A498 


0.9 


93102_HUVEC (Endothelial)_TNF 
alpha + IFN gamma 


0.0 


Renal ca. 
RXF393 


0.0 


93101_HUVEC (Endothelial)_TNF 
alpha + IL4 


0.0 


Renal ca. 
ACHN 


1.5 


93781_HUVEC (Endothelial)_IL-l 1 


. 0.9. 
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Renal ca. 

I TO-'* 1 


0.0 


93583_Lung Microvascular Endothelial 


2.1 


Renal ca. 
TK-10 


4.0 


93584 Lung Microvascular Endothelial 
CellsJTNFa (4 ng/ml) and ILlb (1 
ng/ml) 


1.3 


Liver 


0.0 


92662 ^Microvascular Dermal 
endothelium none 


1.2 


Liver (fetal) 


0.0 


92663_Microsvasular Dermal 
endothelium TNFa (4 ng/ml) and ILlb 
(1 ng/ml) 


1.5 


Liver ca. 

(hepatoblast) HepG2 


0.5 


93773_Bronchial epithelium_TNFa (4 
ng/ml) and ILlb (1 ng/ml) ** 


44.1 


Lung 


0.0 


93347_Small Airway Epithelium_none 


5.6 


Lung (fetal) 


2.2 


93348_Small Airway 

Epithelium TNFa (4 ng/ml) and ILlb 

(1 ng/ml) 


25.7 


Lung ca. (small cell) 
LX-1 


5.2 


92668_Coronery Artery SMCjresting 


0.7 


Lung ca. (small cell) 
NCI-H69 


2.8 


92669_Coronery Artery SMC JTNFa (4 
ng/ml) and ILlb (1 ng/ml) 


0.0 


Lung ca. (s.cell var.) 
SHP-77 


0.8 


93 1 07_astrocytcs_resting 


3.5 


Lung ca. (large 
cell)NCI-H460 


2.0 


93 108 astrocytes TNFa (4 ng/ml) and 
ILlb (1 ng/ml) 


1.4 


Lung ca. (non-sin. . . 
cell)A549 


1.8 


92666_KXJ-812 (Basophil)_resting 


20.4 


Lung ca. (non-s.cell) 
NCI-H23 


4.7 


92667_KU-812. 
(Basophil)_PMA/ionoycin 


34.2 


Lung ca (non-s.cell) 
HOP-62 


1.3 


93579_CCD1 106 (keratinocytes)__none 


0.7 


Lung ca. (non-sxl) 
NCI-H522 


0.0 


93580_CCD1106 
(Jveraanocytesj_ii\ra ana ir fNg 


in i 


Lung ca. (squam.) 

flMT AAA 

SW 900 


0.0 


93791_Liver Cirrhosis 


3.2 


Lung ca. (squam.) 
NCI-H596 


2.2 


93792_Lupus Kidney 


5.2 


Mammary gland 


0.8 


93577 NCI-H292 


6.3 


Breast ca.* (pi. 
effusion) MCF-7 


3.8 


93358_NCI-H292_IL-4 


7.4 


Breast ca.* (pLef) 
MDA-MB-231 


0.0 




O.J 


Breast ca.* (pi. 
effusion) T47D 


0.0 


93359_NCI-H292_IL-13 


8.8 


Breast ca. 
BT-549 


0.0 


93357_NCI-H292_IFN gamma 


4, 
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Breast ca. 
MDA-N 


0.0 


93777_HPAEC_- 


0.9 


Ovary 


0.0 


93778_HPAECJL-1 beta/TNA alpha 


1.4 


Ovarian ca. 
OVCAR-3 


0.0 


93254_Normal Human Lung 
Fibroblast none 


0.0 


Ovarian ca. 
OVCAR-4 


1.6 


93253_Normal Human Lung 
Fibroblast TNFa (4 ng/ml) and IL-lb 
(1 ng/ml) 


0.0 


Ovarian ca. 
OVCAR-5 


1.9 


93257_Normal Human Lung 
Fibroblast IL-4 


1.5 


Ovarian ca. 
OVCAR-8 


2.8 


93256_Normal Human Lung 
Fibroblast IL-9 


0.0 


Ovarian ca. 
IGROV-1 


2.7 


93255_Normal Human Lung 
Fibroblast IL-13 


1.9 


Ovarian ca.* (ascites) 
SK-OV-3 


5.6 


93258_Normal Human Lung 
FibroblastJFN gamma 


2.2 


Uterus 


0.6 


93106_Dcrmal Fibroblasts 
CCD1070_resting 


0.8 


Plancenta 


1.0 


93361_Dermal Fibroblasts 
CCD1070JTNF alpha 4 ng/ml 


1.6 


Prostate 


0.0 


93105 Dermal Fibroblasts 
CCD1070JL-1 beta 1 ng/ml 


0.0 


Prostate ca.* (bone 
met)PC-3 


0.0 


93772_dermal flbroblastJFN gamma 


2.0 


Testis 


100.0 


93771 Jermal fibroblast JL^ 


0.9. 


Melanoma 
IIs688(A).T 


0.0 


93259 JBD Colitis 1** 


21.0 


Melanoma* (met) 
Hs688(B).T 


0.0 


93260 JBD Colitis 2 


1.3 


Melanoma 
UACC-62 


0.0 


93261JBD Crohns 


0-.7 


Melanoma 

1V1 1 *T 


0.9 


735010_Colon_normal 


0.7. 


Melanoma 
LOXIMVI 


0.0 


735019_Lung u none 


2.2 


Melanoma* (met) 
SK-MEL-5 


0.0 


64028-l_Thymus_none 


41.8 


Adipose 


i.9 


64030-l_Kidney_none 


6.0 



Table 24. Panels 2D and 3D 



PANEL 2D 




PANEL 3D 




Tissue Name 


Rel. Expr., 
% 


Tissue Name 


Rel. Expr., 

% 
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2Dtm2324t_ 
agll67 




3dtm5309t 
_agll67 


Normal Colon 
GENPAK 061003 


2.9 


94905_Daoy_Medulloblastoma/Cerebell 
uni sscDNA 


4.0 


83219 CC Well to 
Mod DifT(OD03866) 


2.5 


94906_TE671_Medulloblastom/Cerebel 
lum sscDNA 


0.0 


83220 CC NAT 
(OD03866) 


18.9 


94907_D283 

Med Medulloblastoma/CerebelluTn ssc 
DNA 


8.4 


83221 CC Gr.2 

rectosigmoid 

(OD03868) 


2.2 


94908 J>FSK-l_Prinutive 
Neuroectodermal/Cerebellum_sscDNA 


7.0 


83222 CC NAT 
(OD03868) 


0.0 


94909 JOM98_CNS_sscDNA 


0.0 


83235 CC Mod Diff ' 
(ODO3920) 


6.3 


94910_SNB-7S_CNS/glioma_sscDNA 


10.2 


83236 CC NAT 
(ODO3920) 


12.6 


9491 1_SF- 

268„CNS/glioblastoma_sscDNA 


0.0 


83237 CC Gr.2 
ascend colon 
(OD03921) 


0.0 


94912_T98G_Glioblastoma_sscDNA 


0.0 


83238 CC NAT 
(OD03921) 


7.7 


96776 JSK-N-SH_Neuroblastoma 
(metastasis)_sscDNA 


0.0 


83241 CC from 
Partial Hepatectomy 
(ODO4309) 


2.9 


94913JSF- 

295_CNS/glioblastoma_sscDNA 


0.0 


83242 Liver NAT 
(ODO4309) 


0.0 


949 14_Cerebelluin_6ScDNA 


66.0 


87472 Colon mets to 
lung (OD04451-01) 


5.0 


96777_Cerebellum_sscDNA 


4.4 


87473 Lung NAT 
(OD04451-02) 


9.9 


94916_NCI-H292_Mucocpidennoid 
lung carcinoma_sscDNA 


7.3 


Normal Prostate 
Clontech A+ 6546- 
1 


17.4 


94917_DMS-114_Srnall cell lung 
cancer_sscDNA 


0.0 


84140 Prostate 
Cancer (OD04410) 


. 2.7 


94918JDMS-79_Small cell lung 
cancer/neuroendocrine sscDNA 


100.0 


84141 Prostate NAT 
(OD04410) 


26.2 


94919_NCI-H146_Small cell lung 
cancer/neuroendocrine sscDNA 


16.4 


87073 Prostate 
Cancer (OD04720- 
01) 


20.6 


94920_NCI-H526_Small cell lung 
cancer/neuroendocrine_sscDNA 


2.1 


87074 Prostate NAT 

{C\T\C\A10C\ (\*)\ 
\\JLs\JH /i,v!"vZ ) 


13.9 


94921_NCI-N417_Small cell lung 
cancer/ ncuroenuut/ime oaCL/iN/\ 


0.0 


Normal Lung 
GENPAK 061010 


17.7 


94923_NCI-H82_Small cell lung 
cancer/neuroendocrine sscDNA 


3.5 


83239 Lung Met to 
Muscle (0DO4286) 


. 5.0 


94924 1 NCI-H157_Squamous cell lung 
cancer (metastasis)_sscDNA 


0.0 


83240 Muscle NAT 


2.6 


94925_NCI-H1 1 55_Large cell lung 


0.0 
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(OD04286) 




cancer/neuroendocrine sscDNA 




84136 Lung 
Malignant Cancer 
(OD03126) 


4.9 


94926_NCI-H1299_Large cell lung 
cancer/ncurocndocrinc_sscDNA 


11.3 


84 137 Lung NAT 
(OD03126) 


0.0 


94927_NCI-H727_Lung 
carcinoid sscDNA 


3.0 


S4871 Lung Cancer 
(OD04404) 


5.5 


9492S_NCI-UMC-1 lJLung 
carcinoid sscDNA 


10.1 


84872 Lung NAT 
(OD04404) 


2.4 


94929_LX-l_Small cell lung 
cancer sscDNA 


0.0 


84875 Lung Cancer 
(OD04565) 


2.5 


94930_Colo-205_Colon 
cancer sscDNA 


0.0 


84876 Lung NAT 
(OD04565) 


7.7 


94931_KM12_Colon cancer_sscDNA 


4.9 


85950 Lung Cancer 
(OD04237-01) 


18.4 


94932_KM20L2_Colon 
cancer sscDNA 


0.0 


85970 Lung NAT 
(OD04237-02) 


7.8 


94933_NCI-H71 6_Colon 
cancer sscDNA 


0.0 


83255 Ocular Mel 
Met to Liver 
(ODO4310) 


0.0 


94935_SW-48_Colon 
adenocarcinoma_sscDNA 


0.0 


83256 Liver NAT 
(ODO4310) 


0.0 


94936_SW1116_Colon 
adenocarcinoma sscDNA 


1.6 


84139 Melanoma 
Mets to Lung 
(OD04321) 


3.0 


94937_LS 174T_Colon 
adenocarcinoma_sscDNA 


0.0 


84138 Lung NAT 
(OD04321) 


2.6 


94938_SW-948_Colon 
adenocarcinoma sscDNA 


4.4 ' 


Normal Kidney 
GENPAK 061008 


100.0 . 


94939_SW-480_Colon 
adenocarcinoma sscDNA 


0.0 


83786 Kidney Ca, 
Nuclear grade 2 - 
(OD04338) 


56.6 


94940_NCI-SNU-5_Gastric 
carcinoma_sscDNA 


0.0 


83787 Kidney NAT 
(OD04338) . 


23.2 


94941_KATO m_Gastric 
carcinoma sscDNA 


2.6 


83788 Kidney Ca 
Nuclear grade 1/2 
(OD04339) 


0.0 


94943_NCI-SNU-16_Gastric 
carcinoma_sscDNA 


3.6 


83789 Kidney NAT 
(OD04339) 


46.0 


94944_NCI-SNU-l_Gastric 
carcinoma sscDNA 


0.0 


83790 Kidney Ca, 
Clear cell type 
(OD04340) 


12.9 


94946_RF-l_Gastric 
adenocarcinoma_sscDNA 


0.0 


83791 Kidney NAT 
(OD04340) 


24.5 


94947_RF-48_Gastric 
adenocarcinoma sscDNA 


0.0 


83792 Kidney Ca, 
Nuclear grade 3 
(OD04348) 


4.5 


96778_MKN-45_Gastric 
carcinoma_sscDNA 


0.0 



178 



BNSDOCIO <WO_0iei57BA2J_> 



WO 0I/8157S PCT/US01/1357X 



83793 Kidney NAT 
fOD04348i 


AAA 


94949_NCI-N87j3astric 
carcinoma sscDNA 


0.0 


87474 Kidney Cancer 
(OD04622-0H 


2.8 


S>495 1_0 VCAR-5_Ovarian 
:arcinoma sscDNA 


0.0 


87475 Kidney NAT 
(OD04622-03) 


0.0 


94952_RL95-2_Uterine 
sarcinoma sscDNA 


0.0 


85973 Kidney Cancer 
(OD04450-01) 


4.4 


94953_HelaS3_Cervical 
adenocarcinoma sscDNA 


0.0 


85974 Kidney NAT 
(OD04450-03) 


24.8 


94954_Ca Ski_Cervical epidermoid 
carcinoma (metastasis) sscDNA 


5.8 


Kidney Cancer 
Clontech 8120607 


0.0 


94955_ES-2_Ovarian clear cell 
carcinoma sscDNA 


0.0 


Kidney NAT 
Clontech 8120608 


0.0 


94957_Ramos/6h stim_"; Stimulated 
with PMA/ionomycin 6h sscDNA 


0.0 


Kidney Cancer 
Clontech 8120613 


0.0 


94958_Ramos/14h stimj'; Stimulated 
with PMA/ionomycin 14h_sscDNA 


0.0 


Kidney NAT 
Clontech 8120614 


0.0 


94962_MEG-01_Chronic myelogenous 
leukemia (megokaryoblast)_sscDNA 


28.3 


Kidney Cancer 
Clontech 9010320 


2.6 


94963_Raji_Burkitt , s 
ymphoma sscDNA 


0.0 


Kidney NAT 
Clontech 9010321 


8.4 


94964_Daudi_Burkitt , s 
lymphoma sscDNA 


2.9 


MnTTTml TTt#*nic 
INUiXJlal UICIUd 

GENPAK061018 


0 0 


94965 U266 B-cell 
plasmacytoma/myeloma_sscDNA 


6.4 


TTtf*nic Panrpr 

GENPAK064011 


16.2 


94968 CA46 Burkitt's 
lympfaoma_sscDNA 


0.0 


Normal Thyroid 
Clontech A4- 6570- 

1 


1.8 


94970 JRL.non-Ilodgkin's B-cell 
lymphoma sscDNA 


0.0 


Thyroid Cancer 


27.2 


94972 JMl_pre-B-cell 

1 vrrmhoma/l eukemia sscDNA 


6.7 


Thyroid Cancer 
TNVITROT5FN 
A302152 


5.2 


94973_Jurkat_T cell leukemia_sscDNA 


4.2 


Thvroid NAT 

INVITROGEN 

A302153 


19 9 


94974 TF-1 Erythroleukemia sscDNA 


2.4 


Normal Breast 
GENPAK 061019 


47.3 


94975_HUT78_T-cell 
lymphoma sscDNA 


2.8 


84877 Breast Cancer 
(OD04566) 


7.0 


94977_U937_Histiocytic 
lymphoma sscDNA 


6.6 


R^07S Rrpact PflnrPT 

(OD04590-01) 


1 *> 7 


04980 KII-S12 Mveloeenous 
leukemia sscDNA 


18.2 


85976 Breast Cancer 
Mets (OD04590-03) 


17.7 


9498 1_769-P_C1 ear cell renal 
carcinoma sscDNA 


2.6 


87070 Breast Cancer 

Metastasis 

(OD04655-05) 


76.8 


94983_Caki-2_Clear cell renal 
carcinoma_sscDNA 


0.0 


GENPAK Breast 


42.6 


949S4_SW 839_Clear cell renal 

^ r\xT a 


1.9 
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Cancer 064006 




carcinoma_sscDNA 




Breast Cancer Res. 
Gen. 1024 


47.6 


94986_G401_Wilms* tumor_sscDNA 


0.0 


Breast Cancer 
Clontech 9100266 


2.6 


94987_Hs766T_Pancreatic carcinoma 
(LN metastasis)_sscDNA 


0.0 


Breast NAT Clontech 
9100265 


5.1 


94988_CAPAN-l_Pancreatic 
adenocarcinoma (liver 
metastasis)_sscDNA 


0.0 


Breast Cancer 

1NV1TROGEN 

A209073 


10.7 


94989_SU86.86_Pancreatic carcinoma 
(liver metastasis)_sscDNA 


3.2 


Breast NAT 

INVITROGEN 

A2090734 


12.9 


94990_BxPC-3_Pancreatic 
adenocarcinoma_sscDNA 


0.0 


Normal Liver 
GENPAK 061009 


13.1 


94991_HPAC_Pancreatic 
adenocarcinoma sscDNA 


3.6 


Liver Cancer 
GENPAK 064003 


4.8 


94992_M1A PaCa-2_Pancreatic 
carcinoma sscDNA 


0.0 


Liver Cancer 
Research Genetics 
RNA 1025 


2.5 


94993_CFPAC-l_Pancrearic ductal 
adenocarcinoma_sscDNA 


13.6 


Liver Cancer 
Research Genetics 
RNA 1026 


0.0 


94994_PANC-l_Pancreatic epithelioid 
ductal carcinoma_sscDNA 


0.0 


Paired Liver Cancer 
Tissue Research 
Genetics RNA 6004- 
T 




94996_T24_Bladder carcinma 
(transitional cell)_sscDNA 


3.1 


Paired Liver Tissue 
Research Genetics 
RNA 6004-N 


2.5 


94997,_5637_Bladder 
carcinoma_sscDNA 


0.0 


Paired Liver Cancer 
Tissue Research 
Genetics RNA 6005- 
T 


0.0 


94998JIT-1 19?_Bladder 
carcinoma_sscDNA 


20.2 


Paired Liver Tissue 
Research Genetics 
RNA 6005-N 


0.0 


94999_UM-UC-3_Bladder carcinoma 
[transitional cell)_sscDNA 


0.0 


Normal Bladder 
GENPAK 061001 


3.1 


95000 A204 Rhabdomyosarcoma ssc 
DNA 


3.3 


Bladder Cancer 
Research Genetics 
RNA 1023 


0.0 


95001_HT- 

1 080_Fibrosarcoma_sscDNA 


5.2 


Bladder Cancer 
INVITROGEN 
A302173 


16.7 


95002_MG-63_Osteosarcoma 
T)one)_sscDNA 


0.0 


87071 Bladder 
Cancer (OD047 18- 
01) 


0.0 


95003_SK-LMS-l_Leiomyosarcoma 
vulva)_sscDNA 


7.9 
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87072 Bladder 
Normal Adjacent 
(OD047 18-03) 


2.9 


95004 SJRH30 Rhabdomyosarcoma 
(met to bone marrow)_sscDNA 


0.0 


Nonnal Ovary Res. 
Gen. 


0.0 


95005_A43 l_Epidermoid 
carcinoma sscDNA 


0.0 


Ovarian Cancer 
GENPAK 064008 


12.6 


95007_WM266-4__Mclanoma_sscDNA 


0.0 


87492 Ovary Cancer 
(OD04768-07) 


2.9 


95010JDU 145_Prostate carcinoma 
(brain metastasis) sscDNA 


0.0 


87493 Ovary NAT 
(OD04768-08) 


0.0 


95012_MDA-MB-468_Breast 
adenocarcinoma sscDNA 


0.0 


Normal Stomach 
GENPAK 061017 


5.9 


95013_SCC-4_Squamous cell 
carcinoma of tongue sscDNA 


0.0 


Gastric Cancer 
Clontech 9060358 


0.0 


95014_SCC-9_Squamous cell 
carcinoma of tongue sscDNA 


0.0 


NAT Stomach 
Clontech 9060359 


0.0 


95015_SCC-15_Squamous cell 
carcinoma of tongue sscDNA 


0.0 


Gastric Cancer 
Clontech 9060395 


2.9 


95017_CAL 27_Squamous cell 
carcinoma of tongue sscDNA 


0.0 


NAT Stomach 
Clontech 9060394 


0.0 






Gastric Cancer 
Clontech 9060397 


0.0 






NAT Stomach 
Clontech 9060396 


0.0 






Gastric Cancer 
GENPAK 064005 


10.5 







Expression of gene MOL6 in panel 1.3D is detected in the testes and at very low levels in 
the spleen, but not in any other tissues. Expression in panel 2D indicates higher expression in 
5 normal kidney and markedly lower expression in kidney cancer. This indicates a potential role 
for this gene as a protein therapeutic in cases of kidney cancer. In panel 3D, expression is seen to 
be highest in a specimen of small cell lung cancer, chronic myelogenous leukemia and bladder 
cancer. 

Expression of gene MOL6 in Panel 4D is up-regulaled in keratinocytes and small airway 
10 epithelium after treatment with TNF alpha and EL-1 beta. It is also upregulated in eosinophils 
regardless of treatment and in normal thymus. 

Potential Role(s) of MOL6 in Inflammation: The expression partem of GM_87760758_A 
shows that it is induced in keratinocytes and small airway epithelium in response to pro- 
inflammatory cytokines. Thus the protein in question may contribute to tissue destruction in the 
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airways, recruitment of leukocytes, and tissue remodeling (Reichart et al., 1996 J. Pathol. 178 
(2): 2 15-20). 

Impact of Therapeutic Targeting of MOL6: Antibodies or small molecule therapeutics to 
MOL6 may reduce or inhibit tissue damage due to inflammation in psoriasis, asthma and other 
5 mast cell-mediated diseases both in the skin and in the airways. The results are also suggestive 
of a potential role for MOL6 in the treatment for emphysema (Rice et al., 1998 Curr Pharm Des 
4(5): 381-96). 

MOL7 

10 Expression of gene MOL7 was assessed using the primer-probe sets Agl 876, described 

in Table 25. Results of the RTQ-PCRnins are shown in Table 26. 



Table 25. Probe name: Agl876 



Primers 


Sequences 


Tm 


Lengt 
h 


Start 

Positio 

n 


SEQ 
ID NO: 


Forward 


5'-AGCAAGATTGCTCACACAGAGT-3' 


59.2 


22 


668 


91 


Probe 


TET-5'- . 

CCAGTCAATACCATCATCATCCATGAGG- 
3-TAMRA 


69.1 


28 . 


692 


92 


Reverse 


5'-TATGTTGTTGCTCATGGAGTTG-3' 


58.7 


22 


730 


93 



15 

Table 26. Panel 1.3D 



Tissue Name 


Rcl.Expr.,% 
1.3dx4tm5422t 
agl876_al 


Tissue Name 


Rel. Expr., % 
13dx4tm5422t 
_agl876_al 


Liver 

adenocarcinoma 


0.0 


Kidney (fetal) 


0.0 


Pancreas 


0.0 


Renal ca. 


786-0 


0.0 


Pancreatic ca. 
CAP AN 2 


0.4 


Renal ca. 


A498 


0.5 


Adrenal gland 


0.7 


Renal ca. 


RXF393 


0.0 


Thyroid 


0.0 


Renal ca. 


ACHN 


0.6 


Salivary gland 


0.4 


Renal ca. 


UO-31 


0.0 


Pituitary gland 


0.0 


Renal ca: 


TK-10 


0.4 


Brain (fetal) 


03 


Liver 


0.0 
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Brain (whole) 


4.2 


Liver (fetal) 


0.0 


Drain ^arnyguaidj 


*> < 




0 0 


Brain (cerebellum) 


0.8 


Lung 


0.0 


Brain (hippocampus) 


2.7 


Lung (fetal) 


0.0 


Brain (substantia 
nigra) 


1.4 


Lung ca. (small cell) LX- 1 


1.3 


Brain (thalamus) 


1.6 


Lung ca. (small cell) NCI- 
H69 


0.0 


Cerebral Cortex 


0.6 


Lung ca. (s.cell var.) SHP-77 


0.0 


Spinal cord 


0.4 


Lung ca. (large cell)NCI-H460 


0.4 


CNS ca. (glio/astro) 
U87-MG 


0.0 


Lung ca. (non-sm. cell) A549 


0.5 


CNSca. (glio/astro) 
U-118-MG 


0.2 


Lung ca. (non-s.cell) NCI-H23 


0.0 


CNS ca. (astro) 
SW1783 


0.0 


Lung ca (non-s.cell) HOP-62 


0.0 


CNS ca.* (neuro; met 
) SK-N-AS 


0.7 


Lung ca. (non-sxl) NCI-H522 


0.3 


CNS ca. (astro) 
SF-539 


0.0 


Lung ca. (squam.) SW 900 


0.8 


CNS ca. (astro) 
SNB-75 


0.0 


Lung ca. (squam.) NCI-H596 


0.0 


CNS ca. (glio) 
SNB-19 


.2.2 


Mammary gland 


0.0 


CNS ca. (glio) 
U251 


1.3 


Breast ca.* (pi. effusion) 
MCF-7 


2.9 


CNS ca. (glio) 
SF-295 


0.4 


Breast ca * (pl.ef) MDA-MB- 
231 


0.0 


Heart (fetal) 


0.0 


Breast ca* (pi. effusion) 
T47D 


0.0 


Heart 


0.0 


Breast ca. BT- 


0.6 


Fetal Skeletal 


0.0 


Breast ca. MDA- 

KT 
IN 


0.0 


Skeletal muscle 


0.0 


Ovary 


0.0 


Bone marrow 


0.0 


Ovarian ca. OVCAR- 
3 


0.0 


Thymus 


1.0 


Ovarian ca. OVCAR- 
4 


0.0 


Spleen 


0.0 


Ovarian ca. OVCAR- 
5 


0.3 


Lymph node 


0.5 


Ovarian ca. OVCAR- 
8 


0.8 


Colorectal 


0.0 


Ovarian ca. IGROV- 
1 


0.0 
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Stomach 


0.4 


Ovarian ca.* (ascites) SK-OV- 
3 


0.6 


Small intestine 


0.7 


Uterus 


0.0 


Colon ca. 
SW480 


0.0 


Plancenta 


5.5 


Colon ca.* (SW480 
met)SW620 


0.6 


Prostate 


0.0 


Colon ca. 
HT29 


0.0 


Prostate ca.* (bone met)PC-3 


0.0 


Colon ca. 
HCT-116 


0.3 


Testis 


100.0 


Colon ca. 
CaCo-2 


0.0 


Melanoma Hs688(A).T 


0.0 


83219 CC Well to 
Mod Diff (OD03866) 


0.0 


Melanoma* (met) Hs688(B).T 


0.0 


Colon ca. 
HCC-2998 


0.2 


Melanoma UACC-62 


I A 


Gastric ca.* (liver 
met)NCI-N87 


0.0 


Melanoma M14 


0.0 


Bladder 


0.0 


Melanoma LOXIMVI 


0.0 


Trachea 


32.3 


Melanoma* (met) SK-MEL-5 


0.0 


Kidney 


0.0 


^diposc 


8.2 



Expression of MOL7 in panel 1.3D was highest in testis, followed by trachea. Expression 
in the biain is at much lower levels. This molecule may therefore have a role in male fertility. 
There was low to undetectable expression in the samples, of panel.4D (CT values >35). 



MOL8b 

Expression of gene MOL8b was assessed using the primer-probe sets Ag3183, described 
10 in Table 27. Results of the RTQ-PCR runs are shown in Table 28. 



Table 27. Probe name: Ag3183 



Primers 


Sequences 


Tm 


Lengt 
h 


Start 
Position 


SEQID 
NO: 


Forward 


5'-AAGGGGACGAGTGTGGGATT-3* 


62 


20 


211 


94 


Probe 


TET-5'-TGGCACCGAAGTAGCCGTGGCG- 
3-TAMRA 


74 


22 


301 


95 


Reverse 


5-GCGGGCACTTGGTGTCGCA-3' 


64 


19 


325 


96 
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Table 28. Panel 4D 



Tissue Name 


Rel. Expr., % 
4dx4tm499St 
ag3183_a2 


Tissue N ame 


Rel. Expr.,% 
4dx4rm4998t a 
g3183_a2 


93768 Secondary Till anti- 
CD28/anti-CD3 


0.0 


931O0_HUVEC 
(Endothelial) JL- lb 


5.7 


93769 Secondary Th2 anti- 
CD28/anti-CD3 


0.0 


93779_HUVEC 

(Endothelial)_IFN 

gamma 


18.4 


93770 Secondary Trl anti- 
CD28/anti-CD3 


0.0 


93102_HUVEC 
(Endothelial)JTNF 
alpha 4- EFN gamma 


9.6 


93573JSecondary Thl_resting 
day 4-6 in IL-2 


0.0 


93101_HUVEC 
(Endothelial)_TNF 
alpha + IL4 


11.2 


93572_Secondary Th2_resting 
day 4-6 in IL-2 


0.0 


937S1_HUVEC 
(Endothelial) JL-11 


19.3 


93571_Secondary Trl_resting 
day 4-6 in IL-2 


2.5 


p3583_Lung 
Microvascular 
Endothelial Cells none 


21.6 


93568_primary Thl anti- 
CD28/anti-CD3 


0.0 


93584_Lung 

Microvascular 

Endothelial 

Cells TNFa (4 ng/ml) 

and ILlb (1 ng/ml) 


32.1 


93569 jDrimary Th2 anti- 
CD28/anti-CD3 


0.0 


92662_Microvascular 
Dermal 

endothelium none 


10.7 


93570_primary Trl anti- 
CD28/anti-CD3 


0.0 


92663_Microsvasular 
Dermal 

endothelium. TNFa (4 
ng/ml) and ILlb(l 
ng/ml) 


5.0 


93565_primary Thl_resting dy 
4-6 in IL-2 


0.0 


93773_Bronchial 
epithelium TNFa (4 
ng/ral) and ILlb (1 
ng/ml) ** 


11.6 


93566_primary Th2_resting dy 
4-6 in IL-2 


2.3 


93347_Small Airway 
Epithelium none 


5.6 


93567 ^primary Trl resting dy 
4-6 in IL-2 


0.7 


93348_Small Airway 
EpitheliumJTNFa (4 
ng/ml) and ILlb (1 
ng/ml) 


4.8 


93351_CD45RACD4 
lymphocyte anti-CD28/anti- 
CD3 


29.9 


92668_Coronery 
Artery SMCjesting 


46.8 . 
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93352_CD45RO CD4 
lymphocyte anti-CD2S/anti- 
CD3 


0.0 


92669_Coronery 
Artery SMC TNFa (4 
ng/ml) and ILlb (1 
ng/nil) 


45.1 


V3251_CD8 

Lymphocytes anti-CD28/anti- 
CD3 


0.0 


93 1 07_astrocytes_resti 
ng 


82.4 


y3353_chronic CDS 
Lymphocytes 2ry resting dy 4- 
6 in IL-2 


0.0 


93 108_astrocytes_TNF 
a (4 ng/ml) and ILlb (1 
ng/ml) 


74.9 


93574_chronic CD8 
Lymphocytes 2ry_activated 


0.0 


92666_Kl>812 
(Basophil)_rcsting 


0.0 


93354_CD4_none 


0.4 


92667_KU-812 

(Basophil)__PMA/ionoy 

cin 


0.0 


93252_Secondaiy 
Thl/Th2/Trl anti-CD95 CHI 1 


2.1 


93579_CCD1106 
(Keratinocytes)_none 


0.0 


93103 lAKeellq re^tina 


0 0 


(Keratinocytes)_TNFa 
and TFNe ** 


0 0 

v.V/ 


93788_LAK cells_IL-2 


0.0 


93791_Liver Cirrhosis 


5.6 


93787_LAK cellsJ0L-2+IL-12 


0.0 


93792_Lupus Kidney 


9.3 


93789_LAK cells JL-2+EFN 
gamma 


0.0 


93577_NCI-H292 


2.5 


93790_LAK cells_IL-2+ IL-18 


0.0 


93358_NCI-H292JL-4 


b.o 


93104JLAK 

cells PMA/ionomycin and IL- 
18 


0.0 


93360_NCI-H292_IL-9 


0.0 


j / o_JNifw ceils UL-z_resting 


A A 

U.U 


13 


3.0 


yj iuy_iviixea l^ympnocyte 
ReactionTwo Way MLR 


A A • 
U.U 


AOQC7 XT/™ 1 ! TJTA1* 1T7XT 

yJJ j /_Nd-Tizy2_ lr XM 
gamma 


A A 
U.U 


y 3 1 1 iMMixea Lympnocyte 
Reaction_Two Way MLR 


A A 
U.U 


A "3*7 "7*7 UD A 'E7™' 

y377 /_rir AliC_- 


1 1 A 

31,9 


93 1 1 l_Mixed Lymphocyte 
Reaction_Two Way MLR 


0.0 


93778_HPAEC_IL-1 
beta/TNA alpha 


22.4 


93 1 12_Mononuclear Cells 
(PBMCs) resting 


0.0 


93254_Normal Human 
Lung Fibroblast none 


44.4 


93113 Mononuclear Cells 
(PBMCs)JPWM 


1.8 


93253_NormaI Human 
Lung FibroblastJTNFa 

(A nn/m1\ and TT -t h H 

ng/ml) 


~ 28.3 


93114 Mononuclear Cells 
(PBMCs) PHA-L 


0.0 


93257_Normal Human 
Lung Fibroblast_IL~4 


62.6 


93249JRamos (B cell)_none 


0.0 • 


93256_Normal Human 
Lung Fibroblast_IL-9 


100.0 
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93250_Ranios (B . 
cell) ionomycin 


0.0 


93255_Nonnal Human 
Lung Fibroblast_IL-l 3 


73.3 


93349JB lymphocytes_PWM 


0.0 


93258_Normal Human 
Lung FibroblasHFN 
gamma 


57.1 


93350 B lymphoytes CD40L 
and IL-4 


0.0 


93106_Dermal 

Fibroblasts 

CCD1070_restmg 


91.5 


92665_EOL-l 
(Eosinophil) dbcAMP 
differentiated 


0.0 


93361_Dennal 
Fibroblasts 

CCD1070_TNFalpha4 
ng/ml 


31.1 


93248 JEOL-1 

(Eosinophil)_dbcAMP/PMAio 
nomycin 


1.2 


93105JDennal 

Fibroblasts 

CCD 1 070.IL- 1 beta 1 

ng/ml 


69.7 


93356_Dendritic Cellsnone 


1.8 


93772_dermal 
riorooiasi_iriN gamma 


18.3 


93355_Dendritic Cells_LPS 
100 ng/ml 


0.0 


93771_dermal 
IibroDlast lb-** 


22.4 


93775 Dendritic Cells anti- 
CD40 


0.4 


93259_ffiD Colitis 1** 


3.2 


937 74_Monocytes_resting 


2.8 


93260_fflD Colitis 2 


0.0 


93776_Monocytes_LPS 50 
ng/ml 


0.0 


93261_IBDCrohns 


0.8 


93581_Macrophages_resting 


0.0 


735010_Colon_noimal 


8.9 


93582_Macrophages_LPS 100 
ng/ml 


0.0 


735019^Lung_none 


18.5 


93098JHUVEC 
(Endothelial)_none 


8.4 


64028-l_Thymus_none 


9.4 


93099^HUVEC 
(Endothelial)_starved 


29.9 


64030-l_Kidney_none 


3.0 



Expression of gene CG50889J)2 in two runs of panel LSD was not reproducible and is not considered 
further. 

Expression of gene CG50889J)2 in Panel 4D: There is 30-fold increase in the expression of 
5 CG50889JD2 in activated naive T cells (CD4+ CD45RA cells) as compared to resting CD4 cells. This 
protein is expressed both in resting and activated fibroblasts, endothelium, and epithelium. 
Potential Role(s) of CG50889-02 in Inflammation 

MOL8b may be important in the initial activation of naive T cells. Activated T cells 
initiate the inflammatory process by secreting cytokines and chcmokines, which in turn induce B 
10 cell antibody production leading to the extravasation of leukocytes into inflammatory sites. 
Impact of Therapeutic Targeting of MOL8b: 

Antibody or small molecule therapeutics to MOL8b may block T cell activation in 

response to tissue transplant and reduce or block rejection. These therapeutic drugs may also 

reduce or prevent inflammation in asthma/allergy, psoriasis, arthritis and diabetes in which 
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activated T cells play a pivotal role. Antibodies to MOLSb may also serve as a diagnostic or 
experimental tool to identify and differentiate naive activated T cells from more differentiated T 
cell population (memory T cells). 

MOL9a 

5 Expression of gene MOL9a was assessed using the primer-probe set Ag673, described in 

Table 29. Results of the RTQ-PCR runs are shown in Tables 30 and 3 1 . 

Table 29. Probe name: Ag673 



Primers 


Sequences 


Tm 


Length 


Start 
Position 


SEQID 
NO: 


Forward 


5'-TAGAGTTGGTGGTTCCAGGATT-3' 


59.9 


22 


6 


97 


Probe 


FAM-5'- 

TGATGTCTCCTCTTCAGGCAATGTCT- 
3'-TAMRA 


66.6 


26 


56 


98 


Reverse 


5'-TCTGCCAGCCACAGTATAGG-3' 


58.9 


20 


83 


99 



10 



Table 30. Panels 1.3D and 2D 



PANEL 1. 3D 




PANEL 2D 




Tissue Name 


Rel. Expr., % 
1.3dx4tm5796f 
_ag673_bl 


Tissue Name 


Rel. Expr., % 
2Dtm2310f ag6 
73 


Liver adenocarcinoma 


23.4 


Normal Colon GENPAK 
061003 


39.2 


Pancreas 


0.9. 


83219 GC Well to Mod Diff 
(OD03866) 


15.0 


Pancreatic ca. 
CAP AN 2 


8.4 


83220 CC NAT (OD03866) 


15.1 


Adrenal gland 


0.2 


83221 CC Gr.2 rectosigmoid 
(OD03868) 


13.1 


Thyroid 


0.9 


83222 CC NAT (OD03868) 


3.9 


Salivary gland 


0.2 


83235 CC Mod Diff 
(ODO3920) 


20.3 


Pituitary gland 


0.4 


83236 CC NAT (ODO3920) 


7.6 


Brain (fetal) 


8.2 


83237 CC Gr.2 ascend colon 
(OD03921) 


36.3 


Brain (whole) 


4.0 


83238 CC NAT (OD03921) 


4.8 


Brain (amygdala) 


1.5 


83241 CC from Partial 
Hepatectomy (ODO4309) 


30.6 


Brain (cerebellum) 


2.2 


83242 Liver NAT (ODO4309) 


20.3 
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Brain (hippocampus) 


2.9 


B7472 Colon mets to lung 
r ODfl445 1-011 


41.5 


Brain (substantia nigra) 


1.6 


87473 Lung NAT (OD04451- 


12.1 


Brain (thalamus) 


1.6 


Normal Prostate Clontech A+ 
6546-1 


25.5 


Cerebral Cortex 


5.3 


84140 Prostate Cancer 
COD044101 


22.5 


Spinal cord 


1.7 


84141 Prostate NAT 
COD044101 


23.2 


CNS ca. (glio/astro) 

XJO f 1V1VJ 


6.5 


87073 Prostate Cancer 
fOD04720-011 


10.7 


CNSca. (glio/astro) 

TT 1 1 R-MH 
U 1 1 o -IVlVj 


S.5 


87074 Prostate NAT 
fOPXWfKtel 


24.5 


CNS ca. (astro) 


21.6 


Normal Lung GENPAK 
061010 


27.0 


CNS ca.* (neuro; met ) 


12.6 


83239 Lung Met to Muscle 
<T)D042 861 


63.3 


CNS ca. (astro) 


3.9 


83240 Muscle NAT 
fOD042861 


50.7 


CNS ca. (astro) 


18.7 


84136 Lung Malignant Cancer 


42.3 


CNS ca. (glio) 

QKTR 10 

OlNJj- 17 


4.3 


84137 Lung NAT (OD03126) 


29.1 


CNS ca. (glio) 
U251 


17.7 


84871 Lung Cancer 
(OD04404) 


69.3 


CNS ca. (glio) 
SF-295 


12.6 


84872 Lung NAT (OD04404) 


• 26.8 


Heart (fetal) 


0.2 


84875 Lung Cancer 
(OD04565), 


19.1 


Heart 


0.6 


84876 Lung NAT (OD04565) 


12.2 


Fetal Skeletal 


0.8 


85950 Lung Cancer 
(OD04237-01) 


94.0 


Skeletal muscle 


6.5 


85970 Lung NAT (OD04237- 
02) 


20.3 


Bone marrow 


0.7 


83255 Ocular Mel Met to 
Liver (ODO4310) 


74.2 


Thvmus 

J. 1 11 111 Uu 


0.4 


83256 Liver NAT (ODO4310) 


40.1 


Spleen 


0.3 


84139 Melanoma Mets to 
Lung (OD04321) 


36.3 


Lymph node 


0.8 


84138 Lung NAT (OD04321) 


29.7 


Colorectal 


1.2 


Normal Kidney GENPAK 
061008 


62.4 


Stomach 


0.2 


83786 Kidney Ca, Nuclear 
grade 2 (OD04338) 


66.0 


Small intestine 


0.4 


83787 Kidney NAT 
(OD04338) 


23.3 
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Colon ca. 
SW4S0 


10.2 


83788 Kidney Ca Nuclear 
grade 1/2COD04339) 


34.9 


Colon ca.* (SW480 
met)SW620 


59.0 


83789 Kidney NAT 
(OD04339) 


31.0 


Colon ca. 
HT29 


8.4 


83790 Kidney Ca, Clear cell 
type (OD04340) 


45.7 


Colon ca. 
HCT-116 


14.7 


83791 Kidney NAT 
(OD04340) 


33.0 


Colon ca. 
CaCo-2 


18.6 


83792 Kidney Ca, Nuclear 
grade 3 (OD04348) 


18.3 


83219 CC Well to Mod 
Diff(OD03866) 


2.2 


83793 Kidney NAT 
(OD04348) 


42.9 


Colon ca. 
HCC-2998 


16.6 


87474 Kidney Cancer 
(OD04622-01) 


14.5 


Gastric ca.* (liver met) 
NCI-N87 


12.3 


87475 Kidney NAT 
(OD04622-03) 


3.9 


Bladder 


5.9 


85973 Kidney Cancer 
(OD04450-01) 


70.2 


Trachea 


0.3 


85974 Kidney NAT 
(OD04450-03) 


31.6 


Kidney 


1.8 


Kidney Cancer Clontech 
8120607 


11.4 


Kidney (fetal) 


7.8 


Kidney NAT Clontech 
8120608 


7.1 


Renal ca. 
786-0 


8.5 


Kidney Cancer Clontech 
8120613 


8.2 


Renal ca. 
A498 


7.0 


Kidney NAT Clontech 
8120614 


8.1 


Renal ca. 
RXF393 


17.6 


Kidney Cancer Clontech 
9010320 


18.3 


Renal ca. 
ACHN : 


3.9 


Kidney NAT Clontech 
9010321 . 


33.4 


Renal ca. 
UO-31 


12.8 


Normal Uterus GENPAK 
061018 


11.8 


Renal ca. 
TK-10 


23.4 


Uterus Cancer GENPAK 
064011 


30.1 . 


Liver 


1.4 


Normal Thyroid Clontech A+ 
6570-1 


17.3 


Liver (fetal) 


1.0 


Thyroid Cancer GENPAK 
064010 


39.2 


Liver ca. (hepatoblast) 
HepG2 


9.3 


Thyroid Cancer 
[NVrTROGEN A302152 


18.7 


Lung 


77.4 


Thyroid NAT INVITROGEN 
A302153 


18.7 


Lung (fetal) 


1.7 


Normal Breast GENPAK 
361019 


32.3 


Lung ca. (small cell) 
LX-1 


57.5 


B4877 Breast Cancer 
[OD04566) 


40.6 
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Lung ca. (small cell) 
NCI-H69 


11.7 


85975 Breast Cancer 
(OD04590-01) 


100.0 


Lung ca. (s.cell var.) 
SHP-77 


100.0 


S5976 Breast Cancer Mets 
(OD04590-03) 


76.3 


Lung ca. (large 
cell)NCI-H460 


6.3 


87070 Breast Cancer 
Metastasis (OD04655-05) 


73.2 


Lung ca. (non-sm. cell) 
A549 


28.7 


GENPAK Breast Cancer 
064006 


46.0 


Lung ca. (non-s.cell) 
NCI-H23 


4.8 


Breast Cancer Res. Gen. 1024 


31.2 


Lung ca (non-s.cell) 
HOP-62 


3.1 


Breast Cancer Clontech 
9100266 


41.5 


Lung ca. (non-s.cl) 
NCI-H522 


31.7 


Breast NAT Clontech 
9100265 


13.4 


Lung ca. (squam.) 
SW 900 


19.4 


Breast Cancer INVITROGEN 
A209073 


39.2 


Lung ca. (squam.) 
NTCI-H596 


27.4 


Breast NAT INVITROGEN 
A2090734 


39.8 


Mammary gland 


3.7 


Normal Liver GENPAK 
061009 


20.9 


Breast ca.* (pi. 
effusion) MCF-7 


10.6 


Liver Cancer GENPAK 
064003 


15.6 


Breast ca.* (pl.ef) 
MDA-MB-231 


6.6 


Liver Cancer Research 
Genetics RNA 1025 


17.2 


Breast ca.* (pi. 
effusion) T47D 


17.8 


Liver Cancer Research 
Genetics RNA 1026 


9.6 


Breast ca. 
BT-549 


15.0 


Paired Liver Cancer Tissue 
Research Genetics RNA 
6004-T 


42.9 


Breast ca. 
MDA-N 


5.6 


Paired Liver Tissue Research 
Genetics RNA 6004-N 


18.6 


Ovary 


1.4 


Paired Liver Cancer Tissue 
Research Genetics RNA 
6005-T 


12.2 


Ovarian ca. 
OVCAR-3 


8.4 


Paired Liver Tissue Research 
Genetics RNA 6005-N 


8.1 


Ovarian ca. 
OVCAR-4 


2.6 


Normal Bladder GENPAK 
061001 


33.7 


Ovarian ca. 
OVCAR-5 


44.9 


Bladder Cancer Research 
Genetics RNA 1023 


5.7 


Ovarian ca. 


5.0 


Bladder Cancer 
TNVTTROGFN A302173 


23.5 


Ovarian ca. 
IGROV-1 


5.2 


87071 Bladder Cancer 
(OD04718-01) 


69.7 


Ovarian ca.* (ascites) 
SK-OV-3 


50.1 


87072 Bladder Normal 
Adjacent (OD04718-03) 


20.6 


Uterus 


0.7 


Normal Ovary Res. Gen. 


2.9 


Plancenta 


0.0 


Ovarian Cancer GENPAK 
064008 


18.6 
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Prostate 


0.4 


87492 Ovary Cancer 
(OD04768-07) 


53.2 


Prostate ca.* (bone 
met)PC-3 


1.9 


87493 Ovary NAT 
(OD04768-08) 


13.5 


Testis 


0.3 


Normal Stomach GENPAK 
061017 


15.0 


Melanoma 
Hs688(A).T 


2.1 


Gastric Cancer Clontech 
9060358 


2.5 


Melanoma* (met) 
Hs688(B).T 


2.3 


NAT Stomach Clontech 
9060359 


8.3 


Melanoma 
UACC-62 


5.7 


Gastric Cancer Clontech 
9060395 


19.1 


Melanoma 
M14 


5.6 


NAT Stomach Clontech 
9060394 


4.6 


Melanoma 
LOX 1MV1 


12.5 


Gastric Cancer Clontech 
9060397 


. 29.5 


Melanoma* (met) SK- 
MEL-5 


11.3 


NAT Stomach Clontech 
9060396 


3.7 


Adipose 


3.2 


Gastric Cancer GENPAK 
064005 


9.9 



Table 31. Panels 3D and 4D 



PANEL 3D 




PANEL 4D 




Tissue Name 


Rel. Expr., % 
3dx4tm5137f 
_ag673_bl 


Tissue Name 


Rel. Expr., 
% 

4dtm4833f 
ag673 


94905_Daoy__Medulloblasto 
ma/Cerebellum sscDNA 


13.6 


93768 Secondary Thl anti- 
CD2S/anri-CD3 


11.9 


94906_TE671JVledulloblast 
om/Cerebellum sscDNA . 


11.6 


93769_Secondary Th2_anti- 
CD28/anti-CD3 


8.2 


94907JD283 

Med_Medulloblastoma/Cere 
bellum sscDNA 


53.3 


93770_Secondary Trl_anti- 
CD28/anti-CD3 


6.6 


94908_PFSK-l_Primitive 
NeuToectodennal/Cerebellum 
sscDNA 


9.3 


93573_Secondary Thl_resting 
day 4-6 in IL-2 


0.0 


94909 XF- 

498 CNS sscDNA 


7.8 


93572_Secondary Th2_resting 
day 4-6 in IL-2 


0.8 


94910_SNB- 

78 CNS/glioma sscDNA 


8.6 


93571_Secondary Trl_resting 
day 4-6 in IL-2 


0.5 


9491 1_SF- 

268 CNS/glioblastoma sscD 
NA 


14.1 


93568_primary Thl_anti- 
CD28/anti-CD3 


17.7 


94912_T98G_Glioblastoma_ 
sscDNA 


15.3 


93569_primary Th2 anti- 
CD28/anti-CD3 


9.1 


96776_SK-N- 

SH Neuroblastoma 


15.1 


93570 primary Trl_anti- 
CD28/anti-CD3 


16.3 
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(metastasis)_sscDNA 








295_CNS/glioblastoma_sscD 
NA 


13.6 5 

i 


)3565_primaiy Thl_resting dy 
1-6 in IL-2 


7.9 


94914 Cerebellum sscDNA 


5.5 5 


)3566_primary Th2_resting dy 
t-6 in IL-2 


2.6 


96777_Cerebellum_sscDNA 


0.9 ! 


)3567_primary Trl_resting dy 
t-6 in IL-2 


4.3 


94916_NCI- 

H292_Mucoepidermoid lung 
carcinoma sscDNA 


36.5 ! 
( 


)3351_CD45RA CD4 
lymphocyte anti-CD28/anti- 
CD3 


13.4 


04017 HMQ-1 Id ^TTinll ppII 
y*fyi / jL/ivio~i ih_ohi<iii ecu 

lung cancer_sscDNA 


O. 1 


93352 CD45RO CD4 
lymphocyte anti-CD28/anti- 
CD3 


8.9 


94918_PMS-79_Small cell 
lung 

cancer/neuroendocrine_sscD 
NA 


100.0 


?3251_CD8 

Lymphocytes anti-CD28/anti- 
CD3 


12.9 


94919_NCI-H146_Small cell 
lung 

cancer/neuroendocrine_sscD 
NA 


32.1 


93353_chronic CDS 

f vrrmhncvtes 2rv res tin 2 dv 4- 

6 in IL-2 


10.5 


94920_NCI-H526__Small cell 
lung 

cancer/neuroendocrine_sscD 
NA 


30.0 


93574_chronic CDS 

T vrmYhnrvfps 2rv activated 

CD3/CD28 


5.2 


94921 J^CI-N4 17 JSrnall cell 
lung 

cancer/neuroendocrine_sscD 
NA 


. 3C6 


93354_CD4_none 


1.4 


XTPT USD Qmnll 
y4y/^ INL/i XlOZ oITlUll Cell - 

lung 

cancer/neuroendocrine_ssct) 
NA 




9"V252 Secondary 
Thl/Th2/Trl_anti-CD95 CHH 


0.8 


H157_Squamous cell lung 
cancer (metastasis)_sscDNA 


fn \ 
\i / * i 


9^103 LAIC cells resting 


18.0 


94925_NCI-H1 155_Large 
cell lung 

cancer/neuroendocrine_sscD 

MA 


19.8 


93788_LAKcells_IL-2 


7.8 


94926_NCI-H1299_Large 
cell lung 

cancer/neuroendocrine__sscD 
NA 


36.3 


93787_LAK ceUs_DL-2+IL-12 


8.1 


94927_NCI-H727JLung 
carcinoid sscDNA 


20.2 


93789_LAK cells_IL-2+IFN 
gamma 


13.6 


9492S_NCI-UMC-1 lJLung 
carcinoid sscDNA 


73.3 


• 93790_LAK cells_IL-2+ IL-18 


10.9 
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94929_LX-l_Small cell lung 
cancer_sscDNA 


20.2 


93104_LAK 

cells PMA/ionomycin and 1L- 
18 


4.8 


94930_Colo-205_Co1on 
cancer sscDNA 


11.6 


93578_NK Cells IL-2_resting 


1.7 


94931_KM12_Colon 
cancer sscDNA 


31.6 


93109_Mixed Lymphocyte 
Reaction Two Way MLR 


13.0 


94932_KM20L2_Colon 
cancer sscDNA 


8.5 


931 10_Mixed Lymphocyte 
Reaction_Two Way MLR 


11.4 


94933_NCI-H7 1 6_Colon 
cancer sscDNA 


36.9 


93 1 1 l_Mixed Lymphocyte 
Reaction_Two Way MLR 


3.2 


94935_SW-48_Colon 
adenocarcinoma sscDNA 


12.5 


93 1 12_Mononuclear Cells 
(PBMCs)_resting 


1.4 


94936__SW1116_Colon 
adenocarcinoma sscDNA 


9.0 


93113 Mononuclear Cells 
(PBMCs)_PWM 


23.7 


9493 7_LS 174T_CoIon 
adenocarcinoma sscDNA 


32.6 


93114 Mononuclear Cells 
(PBMCs)_PHA-L 


8.7 


94938_SW-948_Colon 
adenocarcinoma sscDNA 


1.5 


93249_Ramos (B cel1)_none 


19.9 


94939_SW-480_Colon 
adenocarcinoma sscDNA 


6.7 


93250_Ramos (B 
cell)_ionomycin 


100.0 


94940_NCl-SNU-5_Gastric 
carcinoma sscDNA 


12.4 


93349_B lymphocytes_PWM 


65.1 


94941_KATO HI_Gastric 
carcinoma sscDNA 


55.9 


93350_B lymphoytes_CD40L 
and IL-4 


6.0 


94943_NCI-SNU-16_Gastric 
carcinoma_sscDNA 


12.2 


92665_EOL-l 

(Eosinophil)_dbcAMP 

differentiated 


12.5 "•' 


94944_NCI-SNU-l_Gastric 
carcinoma_sscDNA 


. 58.1 


93248JEOL-1 

(Eosinophil)_dbcAMP/PMAio 
nomycin 


4.4 


94946_RF-l_Gastric 
adenocarcinoma sscDNA 


11.4 


93356JDendritic Cells_none 


9.5 • 


94947_RF-48J3astric 
adenocarcinoma sscDNA 


15.3 


93355_Deridritic Cells_LPS 
100 iig/ml 


8.7 


96778_MKN-45_Gastric 
carcinoma sscDNA 


33.2 


93775 Dendritic Cells anti- 
CD40 


12.2 


94949_NCI-N87_Gastric 
carcinoma sscDNA 


16.7 


93774_Monocytes_jesting 


14.8 


9495 l_OVCAR-5_Ovarian 
carcinoma sscDNA 


10.0 


93776 Jtfonocytes_LPS 50 
ng/ml 


15.5 


94952_RL95-2_Uterine 
carcinoma sscDNA 


7.6 


93 5 8 1 JMacrophages_jesting 


35.4 


94953 HelaS3 Cervical 
adenocarcinoma sscDNA 


11 1 

11.1 


ng/ml 


4 4 


94954_Ca Ski_Cervical 
epidermoid carcinoma 
(metastasis) sscDNA 


28.9 


93098_HUVEC 
(Endothelial)_none 


27.0 . 


94955JES-2JDvarian clear 

: „ ~~~TVXT A 


15.5 


93099_HUVEC 


39.2 
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cell carcinoma_sscDNA 




;Endothelial)_starved 




y~Ts—*l XvalJllo/Ull olllll y 

Stimulated with 
PMA/ionomycin 6h sscDNA 


15.8 


93100 HUVEC 
(Endo5ielial)_IL-lb 


12.7 


94958 Ramos/1 4h stim 
Stimulated with 
EMA/ionomycin 
14h sscDNA 


12.4 


93779 HUVEC 
^Endothelial)_EFN gamma 


17.1 


949 62_MEG-0 1 _Chronic 
myelogenous leukemia 
(megokaryoblast)_sscDNA 


25.5 


93102_HUVEC 
(Endothelial) JTNF alpha + 
EFN gamma 


14.0 


94963 Raji Burkitt's 
lymphomasscDNA 


8.4 


93101_HUVEC 
(Endothelial) TNT alpha + IL4 


22.4 


94964_Daudi_Burkitt's 
lymphoma sscDNA 


29.1 


93781_HUVEC 
(Endothelial) IL-11 


11.7 


94965_U266_B-cell 
plasmacytoma/myeloma ssc 
DNA 


8.0 


93583_Lung Microvascular 
Endothelial Cells_none 


14.7 


94968_CA46_Burkitt's 
lymphoma sscDNA 


7.2 


93584_Lung Microvascular 
Endothelial CcllsJTNFa (4 
ng/ml) and ILlb (1 ng/ml) 


19.2 


94970_RL_non-Hodgkin's B- 
cell lymphoma sscDNA 


10.7 


92662_Microvascular Dermal 
endothelium none 


26.8 


94972_JMl_pre-B-cell 
lymphom a/leukemia sscDN 
A 


5.5 


92663_Microsvasular Dermal 
endothelium TNFa(4ng/ml) 
and ILlb (1 ng/ml) 


18.2 


94973_Jurkat_Tcell 
leukemia sscDNA 


15,5 


93773_Bronchial 
epithelium TNFa(4 ng/ml) 
and ILlb (1 ng/ml) ** 


2.4 


94974 IT- 

1 ErythroleukemiasscDNA 


33.9 


93347_Small Airway 
Epithelium_none 


8.7 


94975_HUT78__T-cell 
lymphoma sscDNA 


23.7 


93348_Small Airway 
Epimehum_TNFa (4 ng/ml) 
and ILlb (1 ng/ml) 


. 37.9 


94977 U937 Histiocytic 
lymphomasscDNA 


29.4 


92668_Coronery Artery 
SMC resting 


17.4 


94980 KU- 
812_Myelogenous 
leukemia sscDNA 


27.2 


92669 Coronery Artery 
SMC_TNFa (4 ng/ml) and 
ILlb (1 ng/ml) 


7.4 


94981_769-P_Clear cell 
renal carcinoma sscDNA 


17.3 


93 1 07_astrocytesjresting 


13.1 


949S3_Caki-2_CIear cell 
renal carcinoma sscDNA 


19.9 


93108_asfrocytes_TNFa (4 
ng/ml) and ILlb (1 ng/ml) 


9.3 


94984_SW 839_Clear cell 
renal carcinoma sscDNA 


15.4 


92666JOJ-812 
(Basophil) resting 


13.0 


94986_G401_Wilms' 
tumor sscDNA 


19.5 


92667_KU-812 
(Basophil)_PMA/ionoycin 


26.8 


94987_Hs766T_Pancreatic 
carcinoma (LN 


12.7 


93579_CCD1106 
(Keratinocytes)none 


14.9 



195 



BNSOOCID: <WO_OIB1578A2J_>- 



WO 01/81578 



PCT/US01/13578 



metastasis)_sscDNA 








94988_CAPAN-l_Pancreatic 
adenocarcinoma (liver 
metastasis)_sscDNA 


11.1 


93580_CCD1106 
(Keratinocytes)_TNFa and 
EFNg ** 


1.4 


94989_SU86.86_Pancreatic 
carcinoma (liver 
metastasis)_sscDNA 


38.5 


93791 JLiver Cirrhosis 


2.3 


94990_BxPC-3_Pancreatic 
adenocarcinoma sscDNA 


8.2 


93792_Lupus Kidney 


1.0 


9499 1 _HPAC_Pancreatic 
adenocarcinoma sscDNA 


24.3 


93577_NCI-H292 


12.3 


94992_MIA PaCa- 
2_Pancreatic 
carcinoma sscDNA 


5.6 


93358_NCI-H292_IL-4 


20.0 


94993_CFPAC-l_Pancreatic 
ductal 

adenocarcinoma sscDNA 


32.2 


93360_NCI-H292_IL-9 


31.4 


94994_PANC-l_Pancreatic 
epithelioid ductal 
carcinoma sscDNA 


29.0 


93359_NCI-H292JL-13 


13.7 


94996JT24_Bladder 
carcinma (transitional 
cell)_sscDNA 


13.0 


93357_NCI-H292_EFN gamma 


15.2 


94997_5637_Bladder 
carcinoma sscDNA 


18.4 


93777_HPAEC_- 


14.7 


94998_HT-1197_Bladder 
carcinoma sscDNA 


25.1 


93778_HPAECJL-1 
beta/TNA alpha 


12.7 


94999_UM-UC-3_Bladder 
carcinma (transitional 
cell)_sscDNA 


3.3 


93254_Normal Human Lung 
Fibroblast_none 


8.0 


95000_A204_Rhabdomyosar 
comajsscDNA 


16.1 


93253_Normal Human Lung 
Fibroblast TNFa (4 ng/ml) and 
IL-lb (1 ng/ml) 


3.6 


95001_HT- 

1080 Fibrosarcoma sscDNA 


16.7 


93257_Normal Human Lung 
Fibroblast IL-4 


18.7 


95002_MG- 

63_Osteosarcoma 

(bone)_sscDNA 


12.9 


93256_Normal Human Lung 
Fibroblast JL-9 


13.5 


95003_SK-LMS- 

l_Leiomyosarcoma 

(vulva)_sscDNA 


34.7 


93255_Normal Human Lung 
Fibroblast_IL-13 


10.2 


95004_SJRH30_Rhabdomyo 
sarcoma (met to bone 
marrow)_sscDNA 


16.9 


93258_Normal Human Lung 
Fibroblast JOFN gamma 


17.2 


95005_A43 l_Epidermoid 
carcinoma sscDNA 


9.4 


93 106_Dermal Fibroblasts 
CCD1070_resting 


39.0 


95007_WM266- 

4 Melanoma sscDNA 


6.9 


93361 JDermal Fibroblasts 
CCD1 070. TNF alpha 4 ng/ml 


47.0 
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95010_DU 145_Prostate 
carcinoma (brain 
metastasis) sscDNA 


0.3 


93105_Dermal Fibroblasts 
CCD1070JL-1 beta 1 ng/ml 


15.7 


95012_MDA-MB- 
468_Breast 

adenocarcinoma sscDNA 


13.4 


93772_dermal fibroblastJFN 
gamma 


5.6 


95013_SCC-4_Squamous 
cell carcinoma of 
tone lie sscDNA 


0.1 


93771_dermal fibroblast_IL-4 


11.7 


95014_SCC-9_Squamous 
cell carcinoma of 
toneue sscDNA 


0.2 


93259 JBD Colitis 1** 


0.1 


9501 5_SCC-1 5_Squamous 
cell carcinoma of 
tongue sscDNA 


0.8 


93260 JBD Colitis 2 


0.3 


95017_CAL 27_Squaraous 
cell carcinoma of 
tongue sscDNA 


29.0 


93261__IBDCrohns 


0.5 






73501 0_Colon_normal 


3.1 






735019_Lung u _none 


7.1 






64028-1 JThymus__none 


9.4 






64030-l_Kidney_none 


3.0 



Panel J. 3D description: Hie gene MOL9a is expressed in a number of tissues, including the central 
nervous system, lung, mammary gland and kidney. Moreover, its expression seems to be enhamed in 
tumor cell lines as compared to noimal tissue in most cases with a good merapeutic window, 
5 Panel 2D description: Tissue distribution of this gene MOL9a in panel 2D confirms the results obtained 
in panel' J. 3 D. There is enhanced expression of this gene MOL9a in tumor tissue as against the normal 
adjacent tissue, particularly in lung and kidney cancer, but also in cases of colon cancer, ovarian cancer, 
breast cancer, gastric cancer, bladder cancer, liver cancer and thyroid cancer. Some metastases, 
particularly me lung and those from melanoma express high levels ofMOISa. Corroborative information 
10 about the expression of tins molecule is available in the form ofESTs, mostly from endothelial cells, 

colon, ovarian tumors, pancreas and brain regions. Panel 3D demonstrates increased expression of this 
gene MOL9a in a variety of carcinomas, supporting the results of panels 1.3D and 2D thus demonstrating 
utility for this protein as an antibody target. Tlierefore antibodies specific to this protein may be used as 
a therapeutic in the treatment of various types of cancer, 

15 

Panel 4D Description: This gene MOL9a is upregulated in endothelium, and epithelium 
regardless of stimulus. There is also high level expression of this protein in ionomycin-treated B 
cell line and mitogen (pokeweed mitogen, PWM) treated B cells. Consistent with this finding 
Peripheral blood mononuclear cells (PBMC) treated with PWM also demonstrate increased 
20 expression of this molecule. Further, induction of MOL9a is seen in activated T cells. In PBMC 
the T cell specific mitogen (Phytohemagglutinin, PHA) induces the expression of this transcript 
and in acute and chronically activated T cells the expression of this transcript is increased as 
compared to untreated of resting T cells. 
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The MOL9a is induced in activated B and T lymphocytes and may thus have a potential 
role in inflammation by regulating lymphocyte trafficking, or activation, or increasing tissue 
destruction. This molecule may also serve as a marker for activated T or B cells. 

Impact of Therapeutic Targeting of MOL9a: Small molecule or antibody therapies to the 
5 molecule encoded by MOL9a may inhibit tissue damage due to T or B cell activation and the 
bioactive molecules produced by these cell types. These diseases would include asthma/allergy, 
colitis, Crohn's disease, lupus, and arthritis. Alternatively, protein therapeutics based on this 
molecule could serve as an adjuvant and help boost the effectiveness of vaccines or regulate 
immune status during organ transplant 1 95067 19_BJEXT may also serve as a marker for 
10 activated T cells and serve as a diagnostic tool in determining the extent of inflammation in 
autoimmune diseases such as asthma/allergy, colitis, Crohn's disease, lupus, and arthritis. 



MOL9b 

Expression of gene MOL9b was assessed using the primer-probe set Ag245S, described 
15 in Table 32. Results of the RTQ-PCR runs are shown in Tables 33 and 34. 



Table 32. Probe name: Ag2458 



Primers 


Sequences 


Tm 


Length 


Start 
Position 


SEQID 
NO: 


Forward 


5'-GTTGGTGGTTCCAGGATTTT-3' 


58.7 


20 


58 


100 


Probe 


TET-5'- 

TGATGTCTCCTCTTCAGGCAATGTCT- 
3-TAMRA 


66.6 


26 


104 


101 


Reverse 


5'-CTGCCAGCCACAGTATAGGA-3' 


58.9 


20 


130 


102 



20 Table 33. Panels 1.3D and 2D 



PANEL 1.3D 






PANEL 2D 




Tissue Name 


Rel. Expr., 

% 

1.3dtm4270t 
_ag2458 


Rel. Expr., 

% 

1.3dx4tm54 
07t ag2458 
a2 


Tissue Name 


Rel. Expr., 

% 

2dtm4271t 
ag2458 


Liver 

adenocarcinoma 


33.9 


41.4 


Normal Colon GENPAK 
061003 


62.4 


Pancreas 


1-3 


0.8. 


832 19 CC Well to Mod Diff 
(OD03866) 


16.8 


Pancreatic ca. 


6,9 


44.3 


83220 CC NAT (OD03866) 


11.7 
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CAP AN 2 










Adrenal gland 


0.7 


1.1 


S3221 CC Gr.2 rectosigmoid 
(OD03868) 


18.6 


Thvraid 


2.8 


3.3 


83222 CC NAT (OD03868) 


8.4 


Salivarv eland 


0.6 


2.0 


83235 CC Mod Diff 
(ODO3920) 


34.2 


Pituitary yland 


1.6 


0.4 


83236 CC NAT (ODO3920) 


11.3 


Brain (fetal) 


9.2 


19.7 


83237 CC Gr.2 ascend colon 
(OD03921) 


74.7 


Brain (whole) 


5.4 


17.7 


83238 CC NAT (OD03921) 


9.6 


Brain (amygdala) 


6.7 


9.0 


83241 CC from Partial 
Hpnatectomv ff)DO43091 


35.6 


Brain (cerebellum) 
Brain 

fiippocampus ) 


2.7 


10.2 


83242 Liver NAT 


32.3 


23.0 


13.1 


87472 Colon mets to lung 


29.7 


Brain (substantia 
nigra) 


1.6 


7.2 


87473 Lung NAT 
fYYnn445i-0?^ 


6.4 


Brain (thalamus) 


4.9 


16.6 


Normal Prostate Clontcch 

n, t Ujtu j 


8.8 


Cerebral Cortex 


26.2 


8.2 


84140 Prostate Cancer 
(OD04410) 


25.9 


Spinal cord 


2A 




0/11/11 DrAntoto TvT A T 

(OD04410) 


77 Q 


CNS ca. (glio/astro) 
U87-MG 


11.0 


19. / 


o i\) f j rrostatc cancer 
(OD04720-01) 


1 5 ft 
lJ.O 


CNS ca. 

(glio/astro) U-118- 
ivilj ■ 


45.1 


82.4 


o /U /** xTOState JN A 1 

(OD04720-02) 


ZO.l 


CNS ca. (astro) 

owl /oj 


21,9 


. 46.0 


Normal Lung GENPAK 
opioid 

\J\3 1 Ul V 


24.8 


CNS ca * (neuro; 
met j oiv-iN-/vo 


73.7 


40.2 


83239 Lung Met to Muscle 


100.0 


CNS ca. (astro) 


9.9 


12.0 


83240 Muscle NAT 


31.9 


CNS ca. (astro) 


22.5 


71.5 


84136 Lung Malignant 


29.5 


CNS ca. (glio) 


6.2 


18.1 


84137 Lung NAT 

\\JXJ\JO 1 £i\3 ) 


27,9 


CNS ca. (glio) 
U251 


5.9 


45.0 


84871 Lung Cancer 
(OD04404) 


71.7 


CNS ca. (glio) 
SF-295 


13.0 


26.4 


84872 Lung NAT 
(OD04404) 


15.1 


Heart (fetal) 


1.7 


0.0 


84875 Lung Cancer 
(OD04565) 


28.5 


Heart 


1.2 


4.4 


84876 Lung NAT 
(OD04565) 


13.1 
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Fetal Skeletal 


11.4 


1.2 


85950 Lung Cancer 
(OD04237-01) 


90.1 


Skeletal muscle 


3.8 


41.7 


85970 Lung NAT 
(OD04237-02) 


16.8 


Bone marrow 


1.5 


2.2 


83255 Ocular Mel Met to 
Liver (ODO4310) 


76.3 


Thymus 


0.8 


0.4 


83256 Liver NAT 
(ODO4310) 


26.8 


Spleen 


0.9 


0.8 


841 39 Melanoma Mets to 
Lung (OD04321) 


36.3 


Lymph node 


1.3 


10.4 


84138 Lung NAT 
(OD04321) 


22.5 


Colorectal 


4.8 


3.1 


Normal Kidney GENPAK 
061008 


33.7 


Stomach 


0.0 


3.1 


83786 Kidney Ca, Nuclear 
grade 2 (OD0433S) 


68.8 


Small intestine 


1.1 


2.3 


83787 Kidney NAT 
(OD04338) 


33.4 


Colon ca. 
SW480 


19.5 


18.9 


83788 Kidney Ca Nuclear 
grade 1/2 (OD04339) 


30.6 


Colon ca.* (SW480 
met)SW620 


29.9 


29.8 


83789 Kidney NAT 
(OD04339) 


27.2 


Colon ca. 
HT29 


17.7 


9.7 


83790 Kidney Ca, Clear cell 
type (OD04340) 


49.3 


Colon ca. 
HCT-116 


22.1 


43.8 


83791 Kidney NAT 
(OD04340) 


32.1 


Colon ca. 
CaCo-2 


13.5 


18,4 


83792 Kidney Ca, Nuclear 
grade 3 (OD04348) 


16.0 


83219 CC Well to 
Mod Diff 
(OD03866) 


.10.6 


7.0 


83793 Kidney NAT 
(OD04348) 


35.4 


Colon ca. 
HCC-2998 


45.7 


21.1 


87474 Kidney Cancer 
(OD04622-01) 


13.2 


Gastric ca.* (liver 
met)NCI-N87 


38.4 


69.5 


87475 Kidney NAT 
(OD04622-03) 


4.0 


Bladder 


7.4 


13.6 


85973 Kidney Cancer 
(OD04450-01) 


48.6 


Trachea 


2.9 


3.0 


85974 Kidney NAT 
(OD04450-03) 


30.4 


Kidney 


1.3 


3.5 


Kidney Cancer Clontech 
8120607 


20.4 


Kidney (fetal) 


3.6 


3.9 


Kidney NAT Clontech 
8120608 


7.3 


Renal ca. 
786-0 


10.0 


19.0 


Kidney Cancer Clontech 
8120613 


14.4 


Renal ca. 
A498 


29.7 


28.6 


Kidney NAT Clontech 
8120614 


8.8 


Renal ca. 
RXF393 


5.6 


53.0 


Kidney Cancer Clontech 
9010320 


. 12.2 
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Renal ca. 
ACHN 


5.5 


13.3 1 


Kidney NAT Clontech 

2010321 


22.1 


Renal ca. 
UO-31 


21.6 


45.5 1 

1 


Normal Uterus GENPAK 
36101b 


8.3 


Renal ca. 
TK-10 


20.4 


27.0 1 


Uterus Cancer GENPAK 
D640 11 


15.3 


Liver 


2.9 


2.4 


Normal Thyroid Clontech 
A+ 6570-1 


1 C 1 

15.1 


Liver (fetal) 


3.8 


5.0 


Thyroid Cancer GENPAK 
064010 


33.0 


Liver ca. 

(hepatoblast) 

HepG2 


17.3 


28.2 


Thyroid Cancer 
INVITROGENA302152 


21.0 


Lung 


1.9 


2.5 


Thyroid NAT 
INVllKUuJtlJN A3U21-0 


14.4 


Lung (fetal) 


1.4 


5.6 


Normal Breast GENPAK 

A/" lAlfl 

06101V 


33.2 


Lung ca. (small 
cell) LX-1 


11.8 


40.6 


84877 Breast Cancer 

/r\T\f\ AC£ £\ 

(OD04566) 


44.8 


Lung ca. (small 
cell) NCI-H69 


31.4 


44.0 


85975 Breast Cancer 

/r~\T\f\ A CAA Al\ 

(OD04590-01) 


95.9 


Lung ca. (s.cell 
var.) SHP-77 


69.3 


90.5 


85976 Breast Cancer Mets 

//~\T\ri A CAA 

(OD04590-O3) 


61.1 


Lungca. (large 
cell)NCI-H460 


9.9 


63.6 


87070 Breast Cancer 
Metastasis (OD04655-05) 


38.4. 


Lung ca. (non-sm. 
ccll)-A549 


20.9 


25.9 


GENPAK Breast Cancer 
064006 


33.2 


Lung ca. (non- 
s.cell) NCI-H23 


11.3 


10.7 


Breast Cancer Res. Gen. 
1024 


23.0 


Lung ca (non-s.cell) 
HOP-62 


4.1 


9.1 


Breast Cancer Clontech 
yiuozoo 


33.4 


Lung ca. (non-s.cl) 
NCI-II522 


29.1 


30;o 


Breast NAT Clontech 
9100265 


19.5 


Lung ca. (squam.) 
SW900 


9.7 


20.1 


Breast Cancer 

INVl lKOGEN A209O73 


47.0 


Lung ca. (squam.) 
NCI-H596 


9.5 


43.2 


Breast NAT INVITROGEN 
A2090734 


37.6 


Mammary gland 


5.3 


9.0 


Normal Liver GENPAK 
061009 


15.5 


Breast ca * (pi. 
effusion) MCF-7 


14.3 


30.8 


Liver Cancer GENPAK 
064003 


14.0 


Breast ca * (pl.ef) 
MDA-MB-231 


42.3 


41.1 


Liver Cancer Research 
Genetics RNA 1025 


OA A 

20.4 


Breast ca * (pi. 
effusion) T47D 


9.6 


13.8 


Liver Cancer Research 
Genetics RNA 1026 


7.1 


Breast ca. 
BT-549 


100.0 


100.0 


Paired Liver Cancer Tissue 
. Research Genetics RNA 
6004-T 


24.8 


Breast ca. 

\ XT\ A XT 


17.2 


8.8 


Paired Liver Tissue Research 


18.0 
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MDA-N 






Genetics RNA 6004-N 




Ovary 


4.8 


0.9 


Paired Liver Cancer Tissue 
Research Genetics RNA 
6005-T 


13.4 


Ovarian ca. 
OVCAR-3 


11.3 


20.9 


Paired Liver Tissue Research 
Genetics RNA 6005-N 


7.3 


Ovarian ca: 
OVCAR-4 


2.5 


10.6 


Normal Bladder GENPAK 
061001 


59.9 


Ovarian ca. 
OVCAR-5 


20.0 


26.3 


Bladder Cancer Research 
Genetics RNA 1023 


5.2 


Ovarian ca. 
OVCAR-8 


18.6 


16.5 


Bladder Cancer 
INVITROGEN A3021 73 


30.1 


Ovarian ca. 
IGROV-1 


6.9 


6.8 


87071 Bladder Cancer 
(OD047 18-01) 


65.5 


Ovarian ca.* 
(ascites) SK-OV-3 


32.3 


64.3 


87072 Bladder Normal 
Adjacent (OD047 18-03) 


18.4 


Uterus 


1.8 


4.3 


Normal Ovary Res. Gen. 


5.1 


Plancenta 


2.2 


1.1 


Ovarian Cancer GENPAK 
064008 


29.1 


Prostate 


2.0 


2.4 


87492 Ovary Cancer 
(OD04768-07) 


66.0 


Prostate ca.* (bone 
met)PC-3 


3.1 


3.2 


87493 Ovary NAT 
(OD04768-08) 


7.7 


Testis 


2.0 


1.5 


Normal Stomach GENPAK 
061017 


16 : 4 


Melanoma 
Hs688(A).T 


4.6 


5.4 ■ 


Gastric Cancer Clontech 
9060358 


3.5 


Melanoma* (met) 
Hs688(B).T 


2.4 


6.9 


NAT Stomach Clontech 
9060359 


10.5 


Melanoma 
UACC-62 


1.3 


12.6 


Gastric Cancer Clontech 
9060395 


29.9 


Melanoma 
M14 


5.9 


56;2 


NAT Stomach Clontech 
9060394 ' 


12.9 


Melanoma 
LOXIMVI 


14.7 


8.1 


Gastric Cancer Clontech 
9060397 


42.3 


Melanoma* (met) 
SK-MEL-5 


16.2 


24.7 


NAT Stomach Clontech 
9060396 


6.2 


Adipose 


4.4 


3.4 


Gastric Cancer GENPAK 
064005 


31.0 



Table 34. Panels 3D and 4D 



PANEL 3D 




PANEL 4D 




Tissue Name 


Rel. Expr., 
% 

3dx4tm5121 
t ag2458 b 


Tissue Name 


Rel. Expr., 
% 

4dtm4272t 
ag2458 
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2 






9490 S Danv Medullnh1a<;1 

oma/Cerebellum sscDNA 


13 6 


0^7fiR Secnndarv Thl anti- 
CD28/anti-CD3 


14.5 


94906_TE67 l_Medullobla 

q torn /Pprphpl In m ^pTVNJ A 


7.4 


93769_Secondary Th2_anti- 


7.4 


94907_D2S3 

JV/fpd Ivtpdiill ohlaQtnm A/Ppr 

ebellum sscDNA 


53.0 


93770_Secondary Trl_anti- 
PD?8/anti-CD3 


9.5 


94908 PFSK-1 Primitive 

7*T7UO 1 1 O IV 1 XllilllLlV^ 

Neuroectodermal/Cerebellu 
m sscDNA 


\J.J 


93S73 SprnndarvThl restine dav 
4-6 in IL-2 


0.2 


94909 XF- 

498 CNS sscDNA 


6 8 


93S72 Secondary Th2 restinedav 
4-6 in IL-2 


0.6 


94910 SNB- 

78_CNS/glioma_sscDNA 


9.8 


Q3S71 Secondary Trl resting dav 
4-6 in IL-2 


0.8 


94911__SF- 

96R PNS/alinhlastnrna cqp 

DNA 


10.2 


93568_primary Thl_anti- 
PD?R/anti-PD3 


19.8 


94912_T98G_Glioblastom 

a ccrTYWA 


15.7 


93569_primary Th2jmti- 
rn>8/anti-PD3 ~ 


13.0 


96776_SK-N- 

^piirnhlsictoTim 
oil li^ui vuiaoiunia 

(mctastasis)_sscDNA 


16.5 


93570_primary Trl_anti- 
PD28/anti-CD3 

v^J_/i^O/ anil v^x/j 


19.2 


94913 ^F- 

295 CNS/glioblastoma ssc 
DNA 


7 4 


Q3^6S "nriTnarv Xhl rpQtin<y dv 4-fi 

S O *^U»' UJLlllltliy X 111 IVDHLlti UJr i VJ 

in IL-2 


8.8 


A 


3 9 


03^6 Tvrinriarv Th9 rp^tintr dv 4— 6 

uiiiiiaiy JLiiZf i to lii it u y *i u 

in IL-2 


2.2 


96777 PprpVipllnm ccrTYW 

A 


u.o 


03^67 •nrimarv TYi rp^tiTny dv4-fi 

7JJ\Jf [JlUllOlJf All AvOllllg Ujr *T W 

in IL-2 


3 6 


94916_NCI- 
H292_Mucoepidermoid 

InniT parrinnrria q^pTY^A 

lUUg l*all>lUUllia ociWJ^/l^l^V 


46.9 


93351_CD45RACD4 
lymphocyte_anti-CD28/anti-CD3 


19.5 


949l7_DMS-1l4_Small 
cell lung cancer_sscDNA 


10.9 


93352_CD45RO CD4 
]ymphocyte_anti-CD28/anti-CD3 


17.7 


94918_DMS-79_Sraall cell 
lung 

cancer/neuroendocrine ssc 
DNA 


100.0 


93251_CD8 Lymphocytes_anti- 


9.3 


94919_NCI-H146_Small 

cell lung 

cancer/neuroendocrine ssc 
DNA 


33.4 


93353_chronic CD8 Lymphocytes 
zry_jresiing uy *t-o in 


11.0 


94920_NCI-H526_Small 
cell lung 

cancer/neuroendocrine ssc 
DNA 


30.2 


93574_chronic CD8 Lymphocytes 
2ry_activated CD3/CD28 


5.5 


94921_NCI-N4 17_Small 
cell lung 


26.1 


93354_CD4_none 


0.7 
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cancer/neuroendocrine ssc 
DNA 








94923_NCI-H82_Small 
cell lung 

cancer/neuroendocrine ssc 
DNA 


28.4 


93252 Secondary 
Thl/Th2/Trl_anti-CD95.CH1 1 


0.9 


94924_NCI- 

H157_Squamous cell lung 
cancer 

(metastasis)_sscDNA 


88.0 


93103JLAK cells__resting 


15.8 


94925_NCl-H1155JLarge 
cell lung 

cancer/neuroendocrine ssc 
DNA 


31.3 


93788_LAK cells_IL-2 


6.1 


94926_NCI-H1 299_Large 
cell lung 

cancer/neuroendocrine ssc 
DNA 


32.4 


93787_LAK cells_IL-2+IL-12 


8.8 


94927_NCI-H727_Lung 
carcinoid sscDNA 


23.0 


93789_LAK cells_IL-2+IFN 
gamma 


11.7 


94928JNCI-UMC- 
1 l_Lung 

carcinoid sscDNA 


75.0 


93790_LAK cellsJL-2+ IL-18 


13.2 


94929_LX-l_Small cell 
lung cancer_sscDNA 


17.0 


93104JLAK 

cells_PMA/ionomycin and IL-18 


8.5 


94930_Colo-205_Colon 
cancer sscDNA 


11.1 


93578_NK Cells IL-2_resting 


2.5 


94931_KM12_Colon 
cancer sscDNA 


37.3 


93 109_Mixed Lymphocyte 
Reaction_Two Way MLR 


17.0 


94932_KM20L2_Colon 
cancer sscDNA 


7.8 


93 1 1 0_Mixed Lymphocyte 
Reaction_Two Way MLR 


10.2 


94933_NCI-H716_Colon . 
cancer^sscDNA 


32.3 


93 1 1 l_Mixed Lymphocyte 
Reaction_Two Way MLR 


7.4 


94935 jSW-4S_Coloii 
adenocarcinoma sscDNA 


7.8 


93 1 1 2_Mononuclear Cells 
(PBMCs)_resting 


2.4 


94936__SW1116_Colon 
adenocarcinoma sscDNA 


10.9 


93113 Mononuclear Cells 
(PBMCs)_PWM 


23.7 


94937_LS 174T_Colon 
adenocarcinoma sscDNA 


29.3 


93114 Mononuclear Cells 
(PBMCsLPHA-L 


9.6 


94938JSW-948_Colon 
adenocarcinoma sscDNA 


1.9 


93249_Ramos (B cell)_none 


30.4 


94939__SW-480_Colon 
adenocarcinoma sscDNA 


4.9 


93250_Ramos (B cell)_ionomycin 


100.0 


94940 NCI-WTT- 
5_Gastric 

carcinoma sscDNA 




7jj*r7 d lyuipi lucy ics r w lvi 




94941 JCATOin_Gastric 
carcinoma sscDNA 


53.1 


93350 Blymphoytes CD40Land 
1L-4 


5.5 


94943_NCI-SNU- 

in ,***:„ 


7.3 


92665_EOL-l 


11.7 
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1 \J vj uolllv 

carcinoma sscDNA 




Fn^inonhiH d he AMP 
differentiated 




94944 NCI-SMI I- 
l_Gastric 

carcinoma sscDNA 


64 4 


93248 EOL-1 

£osinophil)_dbcAMP/PMAiononi 
ycin 


6.3 


94946_RF-l_Gastric 
adenocarcinoma sscDNA 


11.4 


93356_Dcndritic Cells_jione 


12.4 


94947JtF-48_Gastric 
adenocarcinoma sscDNA 


15.4 


93355_Dendritic Cells_LPS 100 
ng/ml 


9.0 


96778_MKN-45_Gastric 
carcinoma sscDNA 


28.8 


93775_Dendritic Cells_anti-CD40 


12.5 


94949_NCI-N87J3astric 
carcinoma sscDNA 


19.5 


93774_Monocytes_jesting 


15.2 


9495 1 _0 VCAR-5_Ovarian 
carcinoma sscDNA 


11.7 


93776_Monocytes_LPS 50 ng/ml 


11.7 


94952 RL95-2 Uterine 
carcinoma sscDNA 


4.5 


93581 Macrophages resting 


41.8 


94953 HelaS3 Cervical 
adenocarcinoma sscDNA 


11.3 


93582 Macrophages LPS 100 
ng/ml 


6.8 


94954_Ca Ski_Cervical 
epidermoid carcinoma 

I iiiCLCloLaoIo / lA 


24.3 


93098JHUVEC 
(Endothelial)_none 


35.8 


94955 J3S-2_Ovarian clear 
cell carcinoma sscDNA 


16.1 


93099_HUVEC 
(Endothelial)_starved 


58.2 


94957_Ramos/6h stimj*; 
onmuiaccu wiui 
PMA/ionomycin 
6h sscDNA 


8.7 


93100_HUVEC (Endothelial)_IL- 

10 


16.5 


yHyjo_ivamos/ i*+n suin^ , 
Stimulated with 

P NT A f\ c\t\ f\tn vft i n 

14h sscDNA 


R % 

O.J 


gamma 


26 1 


myelogenous leukemia 
(megokaryoblast)_sscDNA 


22 1 


93102 HUVEC 

(Endotoelial) JTNF alpha + IFN 
gamma 


16.0 


9496^ Raii Fhirlcitt 1 ^ 
lymphomasscDNA 


8 3 

O.J 


93101 HUVEC 

y .J l v i xxv » x>w 

(Endothelial) TNF alpha + IL4 


23.2 


94964_Daudi_Burkitt , s 

1 vmTVh nrn a qccTYNA 


19.8 


93781_HUVEC (Endothelial)_IL- 


13.6 


94965_U266_B-cell 

t*1 3 Qm 5* p \ri nm a / m Vf*l nrn a cc 
L/iodiiiau v wjiiicu iiiyciuiiia so 

cDNA 


6.4 


93583 JLung Microvascular 
Fndnth filial Pells none 


21.0 


94968_CA46_Burkitt's 
l)mphoma__sscDNA 


6.5 


93584_Lung Microvascular 
Endothelial Cells TNFa (4 ng/ml) 
andlLlb (1 ng/ml) 


18.3 


94970 JlL_non-Hodgkin's 
B-cell lymphoma sscDNA 


9.0 


92662_Microvascular Dermal 
endothelium none 


35.4 


94972 JMljre-B-cell 
lymphoma/leukemia_sscD 


4.1 


92663_Microsvasular Dermal 
endothelium TNFa (4 ng/ml) and 


20.0 
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NA 




ILlb (1 ng/ml) 




94973_Jurkal_T cell 
leukemia__sscDNA 


9.8 


93773_Bronchial 

epithelium TNFa (4 ng/ml) and 

ILlb (1 ng/ml) ** 


14.9 


94974JTF- 

l_Eiythroleukemia sscDN 
A 


31.5 


93347_Small Airway 
Epitheliuni_none 


7.2 


94975_HUT78JT-cell 
lymphoma_sscDNA 


15.7 


93348_Small Airway 
Epithelium TNFa (4 ng/ml) and 
ILlb (1 ng/ml) 


45.4 


94977_U937_Histiocytic 
lymphoma_sscDNA 


30.5 


92668_Coronery Artery 
SMCjrestmg 


17.7 


94980_KU- 

8 1 2_Myelogenous 

leukemiasscDNA 


- 21.7 


92669 Coronery Artery 
SMCTNFa (4 ng/ml) and ILlb (1 
ng/ml) 


10.8 


9498 l_769-P_Clear cell 
renal carcinoma sscDNA 


16.6 


93 1 07_astrocytes_resting 


11.1 


94983_Calri-2_Clear cell 
renal carcinoma sscDNA 


16.8 


93108 astrocytes TNFa (4 ng/ml) 
and ILlb (1 ng/ml) 


10.5 


94984_SW 839_Clearcell 
renal carcinoma sscDNA 


11.8 


92666_KU-812 (Basophil)_resting 


18.8 


94986_G401_Wilms' 
tumor sscDNA 


16.5 


92667 JCU-812 
(Basophil)_PMA/ionoy cin 


27.9 


94987_Hs766T_Pancreatic 
carcinoma (LN 
metastasis)_sscDNA 


12.9 


93579_CCD1106 
(Keratinocytes)_none 


20.9 


949SS_CAPAN- 
l_Pancreatic 
adenocarcinoma (liver 
metastasis)_sscDNA 


12.0 


93580_CCD1106 

(Keratinocytes)_TNFa and IFNg 
** 


7.4 


949S9jSU86.86_Pancreati 
c carcinoma (liver 
mctastasis)_sscDNA 


32.6 


9379 l_Liver Cirrhosis 


3.7 


94990 JBxPC-3_Pancreatic 
adenocarcinoma sscDNA 


4.0 


93792 JLupus Kidney 


1.9 


94991_HPAC_Pancreatic 
adenocarcinoma sscDNA 


26.4 


93577_NCI-H292 


18.6 


94992_MIAPaCa- 
2_Pancreatic 
carcinoma sscDNA 


8.2 


93358_NCI-H292_IL^ 


33.7 


94993_CFPAC- 
l_Pancreatic ductal 
adenocarcinoma sscDNA 


30.3 


93360_NCI-H292JL-9 


36.9 


94994_PANC-l_Pancreatic 
epithelioid ductal 
carcinoma sscDNA 


26.7 


93359_NCI-H292_IL-13 


20.0 


94996_T24_Bladder 
carcinma (transitional 
cell)_sscDNA 


8.2 


93357_NCI-H292_IFN gamma 


20.4 
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94997_5637_Bladder 
carcinoma sscDNA 


20.5 


93777_H?AEC_- 


18.8 


9499S_HT-1197_Bladder 
carcinoma sscDNA 


18.0 


9377S_HPAEC_IL-1 bela/TNA 
alpha 


18.9 


94999JJM-UC-3_Bladder 
carcinma (transitional 
cell)_sscDNA 


4.6 


93254_Normal Human Lung 
Fibroblastnone 


9.5 


95000_A204_Rhabdomyos 
arcoma_sscDNA 


20.0 


93253_Normal Human Lung 
Fibroblast TNFa (4 ng/ml) and IL- 
lb (1 ng/ml) 


3.7 


95001JIT- 

1080 Fibrosarcoma sscDN 
A 


15.4 


93257_Normal Human Lung 
Fibroblast_IL-4 


24.7 


95002_MG- 

63_Osleosarcoma 

(bone)_sscDNA 


16.2 


93256_Normal Human Lung 
Fibroblast_IL-9 


19.2 


95003_SK-LMS- 
1 Leiomyosarcoma 
(vulva)sscDNA 


27.5 


93255_Normal Human Lung 
Fibroblast_IL-13 


14.3 


95004_SJRH30_Rhabdom 
yosarcoma (met to bone 
marro\v)_sscDNA 


12.5 


93258_Normal Human Lung 
FibroblastJFN gamma 


23.2 


95005_A43 l_Epidcrmoid 
carcinoma sscDNA 


6.9 


93106_Dermal Fibroblasts 
CCD1070 resting 


47.0 


95007JWM266- 

4 Melanoma sscDNA 


7.5 


93361_Dermal Fibroblasts 
CCD1070JTNF alpha 4 ng/ml 


42.3 


95010J3U 145_Prostate 
carcinoma (brain 
metastasis)_sscDNA 


0.1 


93105_Dermal Fibroblasts 
CCD1070JL-1 beta 1 ng/ml 


20.2 


95012_MDA-MB- 
468_Breast 

adenocarcinoma sscDNA 


13.5 


93772_dermal fibroblastJFN 
gamma 


9.2 


950 1 3_SCXM_Squamoiis 
cell carcinoma of 
tongue_sscDNA 


1.3 


9377 l_dermal fibroblast JL-4 


22.1 


95014_SCC-9_Squamous 
cell carcinoma of 
tonguesscDNA 


0.3 


93259 JBD Colitis 1** 


1.4 


950 1 5_SCC-15_Squamous 
cell carcinoma of 
tongue sscDNA 


0.3 


93260 JBD Colitis 2 


1.1 


95017_CAL 27_Squamous 
cell carcinoma of 
tongue_sscDNA 


24.5 


93261 JBD Crohns 


1.0 






735010_Colon_normal 


4.1 






735019JLung^none 


8.9 






64028-1 JThymus_none 


10.5 
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64030- l_Kidney_none 


3.6 



Pane! J. 3D description: Hie gene MOL9b is expressed in a number of tissues, including the central 
nervous system, lung, mammary gland and kidney. Moreover, its expression seetns to be enhanced in 
tumor cell lines relative to normal tissue in most cases with a good therapeutic window. 
5 Panel 2D description: Tissue distribution of tlie gene MOL9b in panel 2D confirms tlie results obtained in 
panel 1.3 D. Tliere is enhanced expression of this gene (MOL9b) in tumor tissue relative to normal 
adjacent tissue, particularly in lung and kidney cancers, but also in cancers of colon, ovary, breast, 
gastric, bladder, liver and thyroid. Some metastases, particularly lung metastases and those fi om 
melanoma express this gene MOL9h at particularly high levels. Coiroborative information about the 
10 expression of this molecule is available in the form of ESTs, mostly from endotlielial cells, colon, ovarian 
tumors, pancreas and brain regions. Panel 3D for the gene MOL9b shows high level of expression in a 
variety of carcinomas, supporting results from panels L3D and 2D, and demonstrating utility for this 
protein as an antibody target. Tlierefore antibodies specific to this protein may be used as a therapeutic 
in the treatment of various types of cancer. 

15 

Panel 4D Description: The gene MOL9b is upregulated in endothelium, and epithelium 
regardless of stimulus. There is up regulation of this molecule in an ionomycin treated B cell 
line and it is highly expressed in mitogen (PWM) treated B cells. Consistent with this finding, 
PBMCs treated with PWM also up regulate this molecule. Induction of this gene MOL9b is also 
20 seen in activated T cells. In PBMC the T cell specific mitogen PHA induces the expression of 
this transcript and in acute and chronically activated T cells the expression of this transcript is 
increased as compared to untreated or resting T cells. This transcript is also expressed in resting 
macrophages. 

Potential Rolc(s) of in Inflammation: The molecule MOL9b is induced by B and T 
25 lymphocytes and may potentiate inflammation by regulating lymphocyte trafficking, or 
activation, or increasing tissue destruction. This molecule may also serve as a marker for 
activated T or B cells and may also be involved in the differentiation of monocytes into 
macrophages (see reference). Macrophages also participate in inflammation by producing . 
multiple biologically active proteins like cytokines, activating other cells within the local 
30 microenvironment, and ingesting dead and dying cells. 

Impact of Therapeutic Targeting of MOL9b: Small molecule or antibody therapies to the 
molecule encoded for by MOL9b may reduce or eliminate inflammation and tissue damage due 
to T or B cell activation or macrophages and the bioactive molecules produced by these cell 
types. These diseases would include asthma/allergy, colitis, Crohn's disease, lupus, and arthritis. 
35 Alternatively, protein therapeutics based on this molecule could serve as an adjuvant and help 
boost the effectiveness of vaccines or regulate immune status during organ transplant. MOL9b 
also serves as a marker for activated T and B cells and serve as a diagnostic tool for determining 
indirectly measuring the extent of inflammation due to autoimmune diseases which induce T or 
B cells activation such as asthma/allergy, colitis, Crohn's disease, lupus, and arthritis. Elevated 
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expression of the gene MOL9b in macrophages may help in distinguishing resting macrophages 
from monocytes , dendritic cells. 

MOLlOa 

Expression of gene MOLlOa was assessed using the primer-probe set Agl 129, described 
5 in Table 35. Results of the RTQ-PCR runs are shown in Table 36. 



Table 35. Probe name: Agll29 



Primers 


Sequences 


Tm 


Lengt 
h 


Start 
Position 


SEQID 
NO: 


Forward 


S'-CTAGACCAGCAGCTGGATGAT^ 1 


59.5 


21 


1518 


103 


Probe 


TET-5'- 

CTACAGACCAAGTTTGCTCGCCTCCT- 
3'-TAMRA 


68.7 


26 


1539 


104 


Reverse 


S^CAATGCGGTAAGCAATCTTAAG-B 1 


59 


22 


1587 


105 



10 Table 36. Panels 1.3D and 4D 



PANEL 1.3D 




PANEL 4D 




Tissue Name 


Rel. Expr., 
% 

1.3dx4tm57 
71 1 agl 129 
a2 


Tissue Name 


Rel. Expr., 
% 

4Dtml984t 
_agll29 


Liver 

adenocarcinoma 


3.9 


93768 Secondary Thl anti-CD28/anti- 
CD3 


0.0 


Pancreas 


3.7 


93769 Secondary Th2jmti-CD28/anti- 
CD3 


9:0 


Pancreatic ca. 
CAP AN 2 


2.8 


93770 Secondary Trl_anti-€D28/anti- 
CD3 


9.0 


Adrenal gland 


1.8 


93573_Secondary Thlresting day 4-6 
inIL-2 


0.0 


Thyroid 


3.2 


93572 Secondary Th2_resting day 4-6 
inIL-2 


0.0 


Salivary gland 


3.8 


93571 Secondary Trl resting day 4-6 in 
0,-2 


0.0 


Pituitary gland 


2.6 


93568_primary Thl anti-CD28/anti- 
CD3 


0.0 


Brain (fetal) 


21.9 


93569_primary Th2_anti-CD28/anti- 
CD3 


0.0 


Brain (whole) 


14.6 


93570_primary Trl_anti-CD28/anti- 
CD3 


8.9 
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Brain (amygdala) 


13.0 


93565_primary Thl resting dy 4-6 in 
IL-2 


6.8 


Brain (cerebellum) 


1.8 


93566_primary Th2 resting dy 4-6 in 
EL-2 


14.6 


Brain (hippocampus) 


13.7 


93567_prirnary Trl_resting dy 4-6 in IL- 

2 


0.0 


Brain (substantia 
nigra) 


21.4 


93351_CD45RA CD4 lymphocyte_anti- 
CD28/anti-CD3 


5.9 


Brain (thalamus) 


10.3 


93352 CD45RO CD4 lymphocyte_anti- 
CD28/anti-CD3 


14.3 


Cerebral Cortex 


10.5 


93251_CD8 Lymphocytes_anti- 
CD28/anti-CD3 


33.0 


Spinal cord 


8.1 


93353_chronic CDS Lymphocytes 
2ry_resting dy 4-6 in IL-2 


4.2 


CNS ca. (glio/astro) 
U87-MG 


6.0 


93574_chronic CD8 Lymphocytes 
2ry_activated CD3/CD28 


0.0 


CNSca. (glio/astro) 
U-11S-MG 


0.0 


93354_CD4_none 


1.4 


CNS ca. (astro) 
SW1783 


3.5 


93252 Secondary ThlyTh2/Trl anti- 
CD95CH11 


20.2 


CNS ca.* (neuro; met 
) SK-N-AS 


0.0 


93 103_LAK cells_resting 


8.7 


CNS ca. (astro) 
SF-539 


1.7 


93788 JLAKcells_IL-2 


30.8 


CNS ca. (astro) 
SNB-75 


0.0 


93787_LAK cells JL-24-IL-12 


15.6 


CNS ca. (glio) 
SNB-19 


0.9 


93789_LAK cells_IL-2+IFN gamma 


53.2 


CNS ca. (glio) 
U251 


1.4 


93790JLAK cellsJL-2+ IL-18 


20.6 


CNS ca. (glio) 
SF-295 


0.0 


93104_LAK cells_PMA/ionomycin and 
IL-18 


0.0 


Heart (fetal) 


0.0 


93578_NK Cells EL-2jresting 


- 5.9 


Heart 


3.3 


93109_Mixed Lymphocyte 
Reaction_Two Way MLR 


12.9 


Fetal Skeletal 


2.0 


93 1 1 0_Mixed Lymphocyte 
Reaction, Two Way MLR 


3.5 


Skeletal muscle 


2.3 


93 1 1 l_Mixed Lymphocyte 
Reaction_Two Way MLR 


6.8 


Bone marrow 


4.0 


931 12_Mononuclear Cells 
(PBMCs) resting 


17.2 


Thymus 


3.9 


931 13_Mononuclear Cells 
(PBMCs) PWM 


32.5 


Spleen 


0.0 


93114 Mononuclear Cells 
(PBMCs)_PHA-L 


20.3 


Lymph node 


1.7 


93249_Ramos (B cell)_none 


22.2 


Colorectal 


1.4 


93250_Ramos (B cell)_ionomycin 


68.3 
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Stomach 


0.0 ! 


>3349_B lymphocytes_PWM 


33.7 


Small intestine 


0.0 ! 


5>3350_B lymphoytes_CD40L and IL-4 


9.4 


Colon ca. 
SW480 


1.5 ' 


?2665_EOL-l (Eosinophil)_dbcAMP 
iifferentiated 


15.3 


Colon ca.*(SW480 
met)SW620 


2.0 


«248_EOL-l 

[Eosinophil) dbcAMP/PMAionomycin 


0.0 


Colon ca. 
HT29 


2.2 


?3356_Dendritic Cellsjione 


0.0 


Colon ca. 
HCT-116 


3.7 


93355_Dendritic Cells_LPS lOOng/ml 


6.7 


Colon ca. 
CaCo-2 


0.6 


93775_Dendritic Cells_anti-CD40 


0.0 


83219 CC Well to 
Mod Diff (OD03866) 


5.2 


93774_Monocytes_resting 


20.6 


Colon ca. 
HCC-2998 


5.1 


93776_Monocytes_LPS 50 ng/ml 


15.8 


Gastric ca.* (liver 
met)NCI-N87 


1 A fk 
1U.U 


y^Doi iviacropnagcs__reaoug 




Bladder 


4.6 


93582_Macrophages_LPS 100 ng/ml 


64.2 


Trachea 




yjKJyo rlU VCVv t ErnuOulcilalJ I1UI1C 




Kidney 


3.3 


93099JJUVEC (Endothelial)_starved 


0.0 


Kidney (fetal) 


7.3 


93100_HUVEC (Endothelial)JL-lb 


3.5 


Renal ca. 
786-0 


0.0 


93779JIUVEC (Endothelia1)JFN 
gamma 


6.7 


Renal ca. 
A498 


0.0 


93102_IIUVEC (Endothelial) JINF 
alpha + 1FN gamma 


0.0 


Renal ca. 
RXF393 


4.4 


93101_HUVEC (Endothelial)_TNF 
alpha + IL4 


8.1 


Renal ca 
ACHN 


0.0 


9378 1J3UVEC (Endothelial) JL- 11 


0.0 


Renal ca 

UO-31 


0.0 


93583 Lung Microvascular Endothelial 
Cells none 


24.0 


Renal ca. 
TK-10 


5.9 


93584_Lung Microvascular Endothelial 
Cells TNFa(4ng/ml)andILlb(l 
ng/ml) 


17.8 


Liver 


3.4 


92662 Microvascular Dermal 
endothelium none 


5.2 


Liver (fetal) 


0.0 


92663_Microsvasular Dermal 
endothelium_TNFa (4 ng/ml) and ILlb 
W ng/mij 


0.0 


Liver ca. 

(hepatoblast) HepG2 


0.0 


93773 Bronchial epithelium_TNFa (4 
ng/ml) and ILlb (1 ngfail) ** 


79.6 


Lung 


30.2 


93347_Small Airway Epithelium__none 


0.0 


Lung (fetal) 


100.0 


93348_SmalI Airway EpitheliumJTNFa 
(4 ng/ml) and ILlb (1 ng/ml) 


53.6 
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Lung ca. (small cell) 
LX-1 


1.9 


92668_Coronery Artery SMCjresting 


0.0 


Lung ca. (small cell) 
NCI-H69 


9.8 


92669_Coronery Artery SMC TNFa (4 
ng/ml) and ILlb (1 ng/ml) 


0.0 


Lung ca. (s.cell var.) 
SHP-77 


9.5 


93 1 07_astrocytes_resting 


0.0 


Lung ca. (large 
cell)NTCI-H460 


0.0 


93108 astrocytes TNFa (4 ng/ml) and 
ILlb (1 ng/ml) 


19.6 


Lung ca. (non-sm. 
cell) A549 


1.5 


92666JCU-812 (Basophil)_resting 


0.0 


Lung ca. (non-s.cell) 
NCI-H23 


1.8 


92667_KU-812 
(Basophil)_PMA/ionoycin 


15.1 


Lung ca (non-s.cell) 
HOP-62 


0.8 


93579_CCD1 106 (Keratinocytes)_none 


0.0 


Lung ca. (non-s.cl) 
NCI-H522 


8.7 


93580_CCD1106 

(Keratinocytes)_TNFa and IFNg ** 


4.7 


Lung ca. (squam.) 
SW 900 


1.8 


9379 l_Liver Cirrhosis 


47.6 


Lung ca. (squam.) 
NCI-H596 


6.7 


93792_Lupus Kidney 


25.0 


Mammary gland 


8.0 


93577_NCI-H292 


17.0 


Breast ca * (pi. 
effusion) MCF-7 


0.0 


93358_NCI-H292_IL-4 


4.8 


Breast ca* (pl.ef) 
MDA-MB-231 


0.0 


93360_NCI-H292_IL-9 


12.9 


Breast ca * (pi. 
effusion) T47D 


0.0 


93359_NCI-H292_IL-13 


0.0 


Breast ca. 
BT-549 


0.0 


93357_NCI-H292_EFN gamma 


4.4 


Breast ca. 
MDA-N 


0.0 


93777_HPAEC_- 


0.0 


Ovaiy 


2.3 


93778_HPAEC_IL-1 beta/TNA alpha 


6.9 


Ovarian ca. 
OVCAR-3 


0.0 


93254_Normal Human Lung 
Fibroblast_none 


0.0 


Ovarian ca. 
OVCAR-4 


1.2 


93253_Normal Human Lung 
FibroblastJTNFa (4 ng/ml) and IL-lb (1 
ng/ml) 


6.2 


Ovarian ca. 
OVCAR-5 


7.7 


93257_Normal Human Lung 
Fibroblast IL-4 


6.9 

i 


Ovarian ca. 
OVCAR-8 


0.0 


93256_Normal Human Lung 
Fibroblast IL-9 


0.0 


Ovarian ca. 
IGROV-1 


0.0 


93255_Normal Human Lung 
Fibroblast IL-13 


0.0 


Ovarian ca.* (ascites) 
SK-OV-3 


0.0 


93258_Normal Human Lung 
Fibroblast_IFN gamma 


0.0 


Uterus 


1.9 


93 106_Dermal Fibroblasts 
CCD1070_resting 


12.9 
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Plancenta 


0.0 


93361_Dermal Fibroblasts 
(TD1070 TNF alnba 4 ne/ml 


28.5 


Prostate 


4.9 


93105JDermal Fibroblasts 
PPD1070 TL-1 beta 1 ne/ml 


0.0 


Prostate ca.* (bone 


4.4 


93772_dcrmal fibroblast_IFN gamma 


9.8 


Testis 


57.0 


93771_dennal fibroblast JL-4 


2.6 


Melanoma 
Hs688(A).T 


0.0 


93259JBD Colitis 1** 


88.3 


Melanoma* (met) 
Hs688(B).T 


0.0 


93260 JBD Colitis 2 


0.0 


Melanoma 
UACC-62 


0.0 


9326 1JBD Crohns 


0.0 


Melanoma 
M14 


0.0 


7350 1 0_Colon_normal 


20.2 


Melanoma 
LOXIMV1 


3.6 


735019_Lung^none 


29.7 


Melanoma* (met) 
SK-MEL-5 


3.3 


64028- lJThymus_none 


100.0 


Adipose 


3.2 


64030- l_Kidney_none 


10.0 



The gene MOLlOa shows high levels in the testis and in fetal lung in panel 1.3D. This 
indicates that this gene may be used for regeneration therapy in the lung and may also play a role 
in male fertility. The profile in panel 4D shows high expression in thymus with low to 
5 undetectable expression in other tissues and cell lines (Ct values >35). Therefore this gene may 
be involved in T-cell development and may be a marker for immature T cells. 

EXAMPLE 2: SNP ANALYSIS OF MOL6A 

Variant sequences are included in this application. A variant sequence can include a 
10 single nucleotide polymorphism (SNP). A SNP can, in some instances, be referred to as a 

"cSNP 11 to denote that the nucleotide sequence containing the SNP originates as a cDNA. A SNP 
can arise in several ways. For example, a SNP may be due to a substitution of one nucleotide for 
another at the polymorphic site. Such a substitution can be either a transition or a transversion. A 
SNP can also arise from a deletion of a nucleotide or an insertion of a nucleotide, relative to a 
1 5 reference allele. In this case, the polymorphic site is a site at which one allele bears a gap with 
respect to a particular nucleotide in another allele. SNPs occurring within genes may result in an 
alteration of the amino acid encoded by the gene at the position of the SNP. Intragenic SNPs 
may also be silent, however, in the case that a codon including a SNP encodes the same amino 
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acid as a result of the redundancy of the genetic code. SNPs occurring outside the region of a 
gene, or m an intron within a gene, do not result in changes in any amino acid sequence of a 
protein but may result in altered regulation of the expression pattern for example, alteration in 
temporal expression, physiological response regulation, cell type expression regulation, intensity 
5 of expression, stability of transcribed message. 

Method of novel SNP Identification: SNPs are identified by analyzing sequence 
assemblies using CuraGen's proprietary SNPTool algorithm. SNPTool identifies variation in 
assemblies with the following criteria: SNPs are not analyzed within 1 0 base pairs on both ends 
of an alignment; Window size (number of bases in a view) is 10; The allowed number of 

10 mismatches in a window is 2; Minimum SNP base quality (PHRED score) is 23; Minimum 

number of changes to score an SNP is 2/assembly position. SNPTool analyzes the assembly and 
displays SNP positions, associated individual variant sequences in the assembly, the depth of the 
assembly at that given position, the putative assembly allele frequency, and the SNP sequence 
variation. Sequence traces are then selected and brought into view for manual validation. The 

1 5 consensus assembly sequence is imported into CuraTools along with variant sequence changes to 
identify potential amino acid changes resulting from the SNP sequence variation. 
Comprehensive SNP data analysis is then exported into the SNPCalling database. 

Method of novel SNP Confirmation: SNPs are confirmed employing a validated 
method know as Pyrosequencing (See Alderborn et al. Determination of Single Nucleotide 

20 Polymorphisms by Real-time Pyrophosphate DNA Sequencing. (2000). Genome Research. 10, 
Issue 8, August. 1249-1265). In brief, Pyrosequencing is a real time primer extension process of 
* genotyping. This protocol takes double-stranded, biotinylated PCR products from genomic 
DNA samples and binds them to streptavidin beads. These beads are then denatured producing 
single stranded bound DNA. SNPs are characterized utilizing a technique based on an indirect 

25 bioluminometric assay of pyrophosphate (PPj) that is released from each dNTP upon DNA chain 
elongation. Following Klenow polymerase-mediated base incorporation, PPj is released and used 
as a substrate, together with adenosine 5-phosphosulfate (APS), for ATP sulfurylase, which 
results in the formation of ATP. Subsequently, the ATP accomplishes the conversion of luciferin 
to its oxi-derivative by the action of luciferase. The ensuing light output becomes proportional to 

30 the number of added bases, up to about four bases. To allow processivity of the method dNTP 
excess is degraded by apyrase, which is also present in the starting reaction mixture, so that only 
dNTPs are added to the template during the sequencing. The process has been fully automated 
and adapted to a 96-well format, which allows rapid screening of large SNP panels. 
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The DNA and protein sequences for the novel single nucleotide polymorphic variants of 
the Trypsin -like gene of MOL6a (GM87760758_A) are reported in Table 37. Variants are 
reported individually but any combination of all or a select subset of variants are also included. 
In Table 37, the positions of the variant bases and the variant amino acid residues are underlined. 
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Tabic 37. 

A. Variant 13373750 of MOL6a nucleotide sequence (SEQ ID NO. 11). 

C toA at position 360. 

B. Nucleotide sequence of variant at position 360. 

5 1TACCATGAAATATGTCTT CTATTTGGGTG TC CTCGCTGGGACATTTTT CITTGCTGACT CATCTGTC CAG AAAGAAGAC C 

81CTGCTCCCTATTTG3TGTACCTCAAGTCTOVCTTCAACCCCTGTGTGGGCGTCCT 
1 6 1C CAGCTCACTG CTATTTACCAAATCTGAAAGTGATG CTGGGAAATTTCAAG AGCAGAGTCAGAGACG^TACTGAACAG AC 

2 4 lAATTAACCCCATTCAGATCGTCCGCTACTGGAACTACAGTCArAGCGCCCCACA 

3 2 1CTAAGCCTGCCATGCTCAATCCCAAAGTCCAGCCC 

10 4 01CTCTCAGGTTTGGACTGGAGCCAAGAAAACAGTGGCCGACACC^^ 

4 8 1TGATCGAGAATG C CAAAAAAACAGAA CAAGGAAAAAG C CACAGG AATT CCTT ATGT GTGAAATTTGT GAAAGTATT CAG C 

5 6 lCGAATTTTTGGGGAGGTGGCCGTTGCTACrTCTCAT 

6 4 1AGG^GACGTCGGCATCTAGA.CCAATG 
72LA (SEQ ID NO. 106) 

15 

C. Protein Sequence of variant at position 119. 

1MKYVFYL(WLAGTFFFADSSVQKEDPAPYLI^ 

8 1NPIQ IVRYVmSHSAPQDDLMLI KIiAKPAMLNPKVQPLTIJlTTNVRPGTVCliLSGLDWS^ SD 
1 6 1RECOKNRTRKKPQEFLMCE I CES I QPNFWGGGRC YCHLQRQAPGNRGGALHGRGRRHLHQCLQI CILD ( SEQ ID 
20 NO. 107) 



D. Effect of variant on amino acid residue 
Pro to Thr 



25 



EQUIVALENTS 

Although particular embodiments have been disclosed herein in detail, this has been done 
by way of example for purposes of illustration only, and is not intended to be limiting with 
respect to the scope of the appended claims, which follow. In particular, it is contemplated by 

30 the inventors that various substitutions, alterations, and modifications may be made to the 

invention without departing from the spirit and scope of the invention as defined by the claims. 
The choice of nucleic acid starting material, clone of interest, or library type is believed to be a 
matter of routine for a person of ordinary skill in the art with knowledge of the embodiments 
described herein. Other aspects, advantages, and modifications considered to be within the scope 

35 of the following claims. 
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WHAT IS CLAIMED IS: 

1 . An isolated polypeptide comprising an amino acid sequence selected from the group 
consisting of: 

(a) a mature form of an amino acid sequence selected from the group consisting of 
SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30; 

(b) a variant of a mature form of an amino acid sequence selected from the group 
consisting of SEQ IDNOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 
30, wherein one or more amino acid residues in said variant differs from the 
amino acid sequence of said mature form, provided that said variant differs in no 
more than 15% of the amino acid residues from the amino acid sequence of said 
mature form; 

(c) an amino acid sequence selected from the group consisting of SEQ ID NOS:2, 4, 
6, S, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30; and 

(d) a variant of an amino acid sequence selected from the group consisting of SEQ ID 
NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30, wherein one or more 
amino acid residues in said variant differs from the amino acid sequence of said 
mature form, provided that said variant differs in no more than 1 5% of amino acid 
residues from said amino acid sequence. 

2 The polypeptide of claim 1, wherein said polypeptide comprises the amino acid sequence 
of a naturallyK>ccurring allelic variant <5f an amino acid sequence selected from the group 
consisting of SEQ ED NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30. 

3. The polypeptide of claim 2, wherein said allelic variant comprises an amino acid 
sequence that is the translation of a nucleic acid sequence differing by a single nucleotide 
from a nucleic acid sequence selected from the group consisting of SEQ ID NOS:l, 3, 5, 
7,9, 11, 13, 15,17, 19, 21, 23, 25, 27, and 29. 

4. The polypeptide of claim 1, wherein the amino acid sequence of said variant comprises a 
conservative amino acid substitution. 
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5. An isolated nucleic acid molecule comprising a nucleic acid sequence encoding a 
polypeptide comprising an amino acid sequence selected from the group consisting of: 

(a) a mature form of an amino acid sequence selected from the group consisting of 
SEQ ED NOS:2 3 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30; 

(b) a variant of a mature form of an amino acid sequence selected from the group 
consisting of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 
30, wherein one or more amino acid residues in said variant differs from the 
amino acid sequence of said mature form, provided that said variant differs in no 
more than 15% of the amino acid residues from the amino acid sequence of said 
mature form; 

(c) an amino acid sequence selected from the group consisting of SEQ ID NOS:2, 4, 
6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30; 

(d) a variant of an amino acid sequence selected from the group consisting SEQ ID 
NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30, wherein one or more 
amino acid residues in said variant differs from the amino acid sequence of said 
mature form, provided that said variant differs in no more than 15% of amino acid 
residues from said amino acid sequence; 

(e) a nucleic acid fragment encoding at least a portion of a polypeptide comprising an 
amino acid sequence chosen from the group consisting of SEQ IDNOS:2, 4, 6, 8, 
10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30, or a variant of said polypeptide, 
wherein one or more amino acid residues in said variant differs from the amino 
acid sequence of said mature form, provided that said variant differs in no more 
than 15% of amino acid residues from said amino acid sequence; and 

(f) a nucleic acid molecule comprising the complement of (a), (b), (c), (d) or (e). 

6. The nucleic acid molecule of claim 5, wherein the nucleic acid molecule comprises the 
nucleotide sequence of a naturally-occurring allelic nucleic acid variant 

7. The nucleic acid molecule of claim 5, wherein the nucleic acid molecule encodes a 
polypeptide comprising the amino acid sequence of a naturally-occurring polypeptide 
variant. 
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8. The nucleic acid molecule of claim 5, wherein the nucleic acid molecule differs by a 
single nucleotide from a nucleic acid sequence selected from the group consisting of SEQ 
ID NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, and 29. 

9. The nucleic acid molecule of claim 5, wherein said nucleic acid molecule comprises a 
nucleotide sequence selected from the group consisting of: 

(a) a nucleotide sequence selected from the group consisting of SEQ ID NOS: 1,3,5, 
7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, and 29; 

(b) a nucleotide sequence differing by one or more nucleotides from a nucleotide 
sequence selected from the group consisting of SEQ ID NOS: 1, 3, 5, 7, 9, 1 1, 13, 
15, 17, 19, 21, 23, 25, 27, and 29, provided that no more than 20% of the 
nucleotides differ from said nucleotide sequence; 

(c) a nucleic acid fragment of (a); and 

(d) a nucleic acid fragment of (b). 

1 0. The nucleic acid molecule of claim 5, wherein said nucleic acid molecule hybridizes 
under stringent conditions to a nucleotide sequence chosen from the group consisting of 
SEQ ID NOS:l, 3, 5, 7, 9, 1 3, 13, 15, 17, 19, 21, 23, 25, 27, and 29, or a complement of 
said nucleotide sequence. 

1 1 . The nucleic acid molecule of claim 5, wherein the nucleic acid molecule comprises a 
nucleotide sequence selected from the group consisting of: 

(a) a first nucleotide sequence comprising a coding sequence differing by one or 
more nucleotide sequences from a coding sequence encoding said amino acid 
sequence, provided that no more than 20% of the nucleotides in the coding 
sequence in said first nucleotide sequence differ from said coding sequence; 

(b) an isolated second polynucleotide that is a complement of the first polynucleotide; 
and 

(c) a nucleic acid fragment of (a) or (b). 

12. A vector comprising the nucleic acid molecule of claim 1 1 . 

13. The vector of claim 12, further comprising a promoter operably-linked to said nucleic 
acid molecule. 
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14. A cell comprising the vector of claim 12. 

1 5. An antibody that binds immunospecifically to the polypeptide of claim 1 . 

1 6. The antibody of claim 1 5, wherein said antibody is a monoclonal antibody. 

1 7. The antibody of claim 1 5, wherein the antibody is a humanized antibody. 

1 8. A method for determining the presence or amount of the polypeptide of claim 1 in a 
sample, the method comprising: 

(a) providing the sample; 

(b) contacting the sample with an antibody that binds immunospecifically to the 
polypeptide; and 

(c) determining the presence or amount of antibody bound to said polypeptide, 
thereby determining the presence or amount of polypeptide in said sample. 



1 9. A method for determining the presence or amount of the nucleic acid molecule of claim 5 
in a sample, the method comprising: 

(a) providing the sample; 

(b) contacting the sample with a probe that binds to said nucleic acid molecule; and 

(c) determining the presence or amount of the probe bound to said nucleic acid 
molecule, 

thereby determining the presence or amount of the nucleic acid molecule in said sample. 

20. The method of claim 19 wherein presence or amount of the nucleic acid molecule is used 
as a marker for cell or tissue type. 

21. The method of claim 20 wherein the cell or tissue type is cancerous. 

22. A method of identifying an agent that binds to a polypeptide of claim 1, the method 
comprising: 

(a) contacting said polypeptide with said agent; and 

(b) determining whether said agent binds to said polypeptide. 
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23 . The method of claim 22 wherein the agent is a cellular receptor or a downstream effector. 

24. A method for identifying an agent that modulates the expression or activity of the 
polypeptide of claim 1, the method comprising: 

(a) providing a cell expressing said polypeptide; 

(b) contacting the cell with said agent, and 

(c) determining whether the agent modulates expression or activity of said 
polypeptide, 

whereby an alteration in expression or activity of said peptide indicates said agent modulates 
expression or activity of said polypeptide. 

25. A method for modulating the activity of the polypeptide of claim 1 , the method 
comprising contacting a cell sample expressing the polypeptide of said claim with a 
compound that binds to said polypeptide in an amount sufficient to modulate the activity 
of the polypeptide. 

26. A method of treating or preventing a MOLX-associated disorder, said method comprising 
administering to a subject in which such treatment or prevention is desired the 
polypeptide of claim 1 in an amount sufficient to treat or prevent said MOLX-associated 
disorder in said subject. 

27. The method of claim 26 wherein the disorder is selected from the group consisting of 
cardiomyopathy and atherosclerosis. 

28. The method of claim 26 wherein the disorder is related to cell signal processing and 
metabolic pathway modulation. 

29. The method of claim 26, wherein said subject is a human. 

30. A method of treating or preventing a MOLX-associated disorder, said method comprising 
administering to a subject in which such treatment or prevention is desired the nucleic 
acid of claim 5 in an amount sufficient to treat or prevent said MOLX-associated disorder 
in said subject. 
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31. The method of claim 30 wherein the disorder is selected from the group consisting of 
cardiomyopathy and atherosclerosis. 

32. The method of claim 30 wherein the disorder is related to cell signal processing and 
metabolic pathway modulation. 

33. The method of claim 30, wherein said subject is a human. 

34. A method of treating or preventing a MOLX-associated disorder, said method comprising 
administering to a subject in which such treatment or prevention is desired the antibody 
of claim 15 in an amount sufficient to treat or prevent said MOLX-associated disorder in 
said subject 

35. TTie method of claim 34 wherein the disorder is diabetes. 

36. The method of claim 34 wherein the disorder is related to cell signal processing and 
metabolic pathway modulation. 

37. The method of claim 34, wherein the subject is a human. 

38. A pharmaceutical composition comprising the polypeptide of claim 1 and a 
pharmaceutically-acceptable carrier. 

39. A pharmaceutical composition comprising the nucleic acid molecule of claim 5 and a 
pharmaceutically-acceptable carrier. 

40. A pharmaceutical composition comprising the antibody of claim 1 5 and a 
pharmaceutically-acceptable earner. 

41. A kit comprising in one or more containers, the pharmaceutical composition of claim 38. 

42. A kit comprising in one or more containers, the pharmaceutical composition of claim 39. 
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43. A kit comprising in one or more containers, the pharmaceutical composition of claim 40. 

44. A method for determining the presence of or predisposition to a disease associated with 
altered levels of the polypeptide of claim 1 in a first mammalian subject, the method 
comprising: 

(a) measuring the level of expression of the polypeptide in a sample from the first 
mammalian subject; and 

(b) comparing the amount of said polypeptide in the sample of step (a) to the amount 
of the polypeptide present in a control sample from a second mammalian subject 
known not to have, or not to be predisposed to, said disease; 

wherein an alteration in the expression level of the polypeptide in the first subject as compared to 
the control sample indicates the presence of or predisposition to said disease. ) 

45. The method of claim 44 wherein the predisposition is to cancers. 

46. A method for determining the presence of or predisposition to a disease associated with 
altered levels of the nucleic acid molecule of claim 5 in a first mammalian subject, the 
method comprising: 

(a) measuring the amount of the nucleic acid in a sample from the first mammalian 
subject; and 

(b) comparing the amount of said nucleic acid in the sample of step (a) to the amount 
of the nucleic acid present in a control sample from a second mammalian subject 
known not to have or not be predisposed to, the disease; 

wherein an alteration in the level of the nucleic acid in the first subject as compared to the 
control sample indicates the presence of or predisposition to the disease. 

47. The method of claim 46 wherein the predisposition is to a cancer. 

48. A method of treating a pathological state in a mammal, the method comprising 
administering to the mammal a polypeptide in an amount that is sufficient to alleviate the 
pathological state, wherein the polypeptide is a polypeptide having an amino acid 
sequence at least 95% identical to a polypeptide comprising an amino acid sequence of at 
leastoneofSEQIDNOS:2, 4,6, 8, 10, 12, 14, 16, 18, 20,22,24, 26, 28, and 30, ora 
biologically active fragment thereof. 
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A method of treating a pathological state in a mammal, the method comprising 
administering to the mammal the antibody of claim 15 in an amount sufficient to alleviate 
the pathological state. 

A method for the screening of a candidate substance interacting with an olfactory 
receptor polypeptide selected from the group consisting of SEQ ID NOS:2, 4, 6, 8, 10, 
12, 14, 16, 18, 20, 22, 24, 26, 28, and 30, or fragments or variants thereof, comprises the 
following steps: 

a) providing a polypeptide selected from the group consisting of the 
sequences of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 
30, or a peptide fragment or a variant thereof; 

b) obtaining a candidate substance; 

c) bringing into contact said polypeptide with said candidate substance; and 

d) detecting the complexes formed between said polypeptide and said 
candidate substance. 

A method for the screening of ligand molecules interacting with an olfactory receptor 
polypeptide selected from the group consisting of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 
18, 20, 22, 24, 26, 28, and 30, wherein said method comprises: 

a) providing a recombinant eukaryotic host cell containing a nucleic acid 
encoding a polypeptide selected from the group consisting of the polypeptides 
comprising the amino acid sequences SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 
20, 22, 24, 26, 28, and 30; 

b) preparing membrane extracts of said recombinant eukaryotic host cell; 

c) bringing into contact the membrane extracts prepared at step b) with a 
selected ligand molecule; and 

d) detecting the production level of second messengers metabolites. 

A method for the screening of ligand molecules interacting with an olfactory receptor 
polypeptide selected from the group consisting of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 
1 8, 20, 22, 24, 26, 28, and 30, wherein said method comprises: 
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a) providing an adenovirus containing a nucleic acid encoding a polypeptide 
selected from the group consisting of polypeptides comprising the amino acid 
sequences SEQIDN0S:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, and 30; 

b) infecting an olfactory epithelium with said adenovirus; 

c) bringing into contact the olfactory epithelium b) with a selected ligand 
molecule; and 

d) detecting the increase of the response to said ligand molecule. 
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